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Preface 


Real analysis is the important branch of mathematics that investigates the properties 
of the real numbers and establishes the theory behind calculus, differential equations, 
probability, and other related subjects. The main concepts studied in real analysis 
are sets of real numbers, functions, limits, sequences, continuity, differentiation, in- 
tegration, and sequences of functions. The study of these topics allows one to gain a 
much deeper understanding of the behavior and properties of real-valued functions, 
sequences, and sets of real numbers. Such depth allows one to really understand why 
the theorems of the calculus are true. 

One of the main goals of a course in real analysis is to cover the proofs that 
were omitted in calculus. Calculus books usually prove a few of the easiest theorems. 
However, if you look carefully, you will see that most calculus textbooks do not 
prove, in the main body of the text, many of the most important theorems in the 
calculus. The Intermediate Value Theorem, the Extreme Value Theorem, and the 
integrability of continuous functions on a closed bounded interval are results all crucial 
to calculus and to higher analysis. Moreover, these theorems can be stated in terms 
that a calculus student can understand, but an actual proof cannot be achieved 
without addressing a fundamental question: What are the important properties of 
the set of real numbers? This text provides an answer to this question. 

Typically, undergraduates view real analysis as one of the most difficult courses 
that a mathematics major is required to take. The main reason for this perception is 
twofold: One must comprehend new abstract concepts and learn to deal with these 
concepts on a level of rigor and proof not previously encountered. In particular, for 
many of these students, the mental gymnastics required to prove theorems about 
limits is initially formidable. They struggle to find the appropriate values (e.g., 6 or 
N) that are necessary to compose a logically correct proof concerning the limit of a 
function or a sequence. This text offers a resolution to this difficulty. The book not 
only presents the central theorems of real analysis, but also shows the reader how to 
compose and produce the proofs of these theorems. This approach should be to the 
benefit of any undergraduate reader. 

In real analysis, a facility for working with the supremum of a bounded set, the 
limit of a sequence, and the «-d definitions of continuity are essential for a student to 
be successful. This book is designed to show students how to overcome their initial 
hurdles and gain such a facility. I present proof strategies that explicitly show students 
how to deal with the fundamental definitions that one encounters in real analysis; 
each of which is followed by numerous examples of proofs that use these strategies. In 
Chapters 2—4, many of the introductory proofs are preceded by a “proof analysis” that 
carefully explains how to apply these strategies. The proof analysis is then followed 


ix 


xX H Preface 


by the actual proof. When a real analysis instructor presents a proof in class, these 
strategies can also help to expedite a student’s understanding of the proof and its 
logical structure. 


Roadmap to the Book 


The book presents a mathematical theory that validates the calculus of a single vari- 
able. Moreover, the book is designed for students who are learning the rudiments of 
this theory for the first time. Students typically stumble when first asked to compose 
proofs concerning the core concepts covered in real analysis. To assist such students, 
we present specific strategies for each core concept that are designed to guide a stu- 
dent in the discovery of a correct proof. In fact, these strategies will be appreciated 
by students at all levels of preparation. 

Chapter 1 reviews some of the important topics that a student usually learns in 
a discrete mathematics course and/or a transition to advanced mathematics course. 
This chapter is intended to act as a reference for the basic mathematical concepts 
that will appear in the remaining chapters of the book. 

Chapter 2 presents the properties of the real number system. The first two sections 
deal with the algebraic and order properties of the real numbers and the crucial 
completeness property. In this chapter, we identify strategies that are designed to 
give the reader a guide on how to prove theorems that concern the supremum and 
infimum of a nonempty bounded set of real numbers. The chapter also shows how the 
completeness axiom implies the Archimedean Property,! the density of the rational 
numbers, and the nested intervals theorem. After delivering a succinct introduction 
to countable sets, we prove that the set of real numbers is uncountable. The argument 
is based on Cantor’s first proof of this result. 

Chapter 3 focuses on sequences of real numbers. The limit of a sequence is one 
of the most important concepts in mathematical analysis. Our initial emphasis is to 
identify the strategies that are used to prove limit theorems. The attention then turns 
to subsequences, monotone sequences, Cauchy sequences, and the limit superior and 
limit inferior of a bounded sequence. 

Continuity is the central topic covered in Chapter 4, which begins on the principal 
technique that is used to prove that a function is continuous at a point. This is 
followed by the fundamental theorems on continuity. Then the sequential criterion 
for the continuity of function is covered. This is followed by a section on the limit 
of a function and its sequential equivalent. The chapter ends on the topic of uniform 
continuity. 

In Chapter 5, we present a theoretical treatment of the derivative and derive its 
principal properties. We also identify and establish the following three significant 
results that concern the derivative: the Mean Value Theorem, Cauchy’s Mean Value 
Theorem, and Taylor’s Theorem. We also prove four versions of L’H6pital’s Rule, 
using the sequential criterion for limits. 


'This property is named in honor of Archimedes, a great mathematician who is also regarded 
as one of the leading scientists in classical antiquity. 
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The Riemann integral is the main subject of Chapter 6. We follow the development 
of Jean Gaston Darboux, which simplifies Riemann’s original idea. In this chapter, 
we show that continuous functions, monotone functions, and functions of bounded 
variation are Riemann integrable. After proving the two versions of the Fundamental 
Theorem of Calculus, we establish three techniques of antidifferentiation, the latter 
of which validates integration by trigonometric substitution and Weierstrass substi- 
tution. The chapter ends with a discussion on improper integrals. 

Using results from Chapter 3 on the convergence of sequences and on the limit 
superior of a sequence, Chapter 7 confirms the important theorems that concern an 
infinite series of real numbers. The chapter ends by establishing results on the effect of 
regrouping and rearranging the terms of a series. Finally, in Chapter 8, we investigate 
sequences and series of functions, including power series and Taylor series. 

In Appendix A, we provide a technical proof of a theorem on the integrability 
of a composite function. In Chapter 6, we just state and apply this useful theorem. 
Appendix B covers the topology of the real numbers, including compactness and the 
Heine—Borel Theorem. The appendix ends with a section on the Cantor set and some 
of its remarkable properties. The material in this appendix is not used in the text 
itself, but is included to present the reader with an optional topic that often appears 
in more advanced courses in analysis. It has been my experience that students have 
considerable difficulty with topological concepts. Such concepts tend to make a first 
course in real analysis more challenging than it needs to be. Since topology is not 
required to prove the theorems in this text, this topic is presented in an appendix. 
Finally, Appendix C provides a brief review of logic and the basic proof strategies 
that are typically covered in an introduction to proof course. 

Exercises are given at the end of each section in a chapter and in Appendix B. 
Exercise notes often appear at the end of an exercise set. The notes offer suggestions 
for the more challenging problems and comments that relate to specific exercises. An 
exercise marked with an asterisk * is one that is cited, or referenced, elsewhere in the 
book. A referenced exercise without an adjoined page number appears in the exercise 
set below the reference. 


How to Use the Book 


The text assumes that the reader has completed a three-term calculus sequence and 
recalls how to differentiate and integrate the typical functions that appear in such a 
course sequence. It is also assumed that the reader is familiar with the standard tech- 
niques of proof; that is, the reader should know how to read and write a mathematical 
proof. These techniques are covered in a “transition course,” which is typically a pre- 
requisite for a course in real analysis. Nevertheless, my primary goal was to write 
a book for a reader who may not be sufficiently well-versed in the proof techniques 
that are often applied in real analysis. Consequently, each proof presented in the book 
favors detail over brevity. However, this detail is intended mainly for the reader and 
it can be abridged in a lecture setting. 

Depending on the student’s background, Chapter 1 can be briefly discussed in 
class or given as assigned reading. However, on the first day of class, one should 
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discuss Sections 1.1.2, 1.1.3, 1.1.4, 1.1.5, and in particular the Sum and Product 
Principles of Inequality in Section 1.1.8. A one-semester course that ends with the 
Fundamental Theorem of Calculus can then present Chapters 2 through 5 together 
with Sections 6.1, 6.2.1, and 6.2.3 followed by Theorem 6.3.1, Corollary 6.3.2, and 
Theorems 6.4.2 and 6.4.4. On the other hand, one could thoroughly cover the first five 
chapters, and then pursue the remaining parts of the book, including Appendix B, 
in a second semester. In any case, Sections 2.5, 5.2.1, 6.3.3, 6.4.3, and 7.3 can be 
omitted without loss of continuity. Section 3.8 should be discussed if covering Sec- 
tions 7.2.5, 8.3, and 8.4. 


To the Student 


As an undergraduate, I found my first real analysis course with its focus on proof to 
be very difficult. I thought that I might fail the course. But, in the end, I passed with 
a good grade. The reason for this is perfectly described by the following quote. 


Patience and perseverance have a magical effect before which difficulties 
disappear and obstacles vanish. — John Quincy Adams 


This experience as an undergraduate inspired me to write this book, which is designed 
to increase your confidence by providing you with a guide for finding and writing 
proofs in real analysis. This guide involves a so-called “proof diagram.” Such a guide 
is illustrated in Section 1.1.5. A proof diagram demonstrates the structure of the proof 
and provides a tool for writing a correct mathematical proof. Even with a guide, the 
work required to find a proof can be quite challenging. Professional mathematicians 
also have difficulty finding proofs; however, as I learned in my first real analysis 
course, persistence often pays off and thus, they do not easily give up. 

To be successful in this course, review the material in Chapter 1 and Appendix C, 
even if it is not explicitly covered in class. Tenaciously read and understand every 
section, lemma, theorem, corollary, and proof discussed from the text and diligently 
work on the assigned exercises. The only way to acquire a deep understanding of 
real analysis is through the exercises. Take advantage of the exercise notes that are 
provided for many of the exercises. Use the proofs provided in the text as models for 
your own proofs. Follow this advice and you, too, can pass with a good grade. 
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CHAPTER | 


Proofs, Sets, Functions, and 
Induction 


In this introductory chapter, we review and identify the preliminaries that are essen- 
tial for real analysis. In particular, we review inequalities, sets, functions, and proof 
by mathematical induction. These preliminary topics should be familiar, as should 
be the basic proof techniques that are used in mathematics. A review of logic and 
proof is presented in Appendix C on page 253. The most important proof strategies 
that are applied in this text can be found in the appendix starting on page 256. 


1.1 PROOFS 


1.1.1. Important Sets in Mathematics 


The set concept is frequently used in mathematics. A set is a well-defined collection 
of objects. The items in such a collection are called the elements or members of the 
set. The symbol “€” is used to indicate membership in a set. Thus, if A is a set, 
we write x € A to declare that x is an element of A. Moreover, we write x ¢ A to 
assert that x is not an element of A. In mathematics, a set is typically a collection 
of mathematical objects, for example, numbers, functions, or other sets. Certain sets 
are frequently used in mathematics. The most commonly used ones are the sets of 
natural numbers, integers, and rational and real numbers. These sets will be denoted 
by the following symbols: 


1. N= {1,2,3,...} is the set of natural numbers. 

2. Z={...,—3,—-2,—-1,0,1,2,3,...} is the set of integers. 
3. Q is the set of rational numbers. So, 3 EQ. 
4 


. R is the set of real numbers and so, 7 € R. 


In this text, we do not consider 0 to be a natural number. 

A set can sometimes be identified by enclosing a list of its elements by curly 
brackets; for example, the set A = {1,2,3,4,5,6,7,8,9} consists of a few natural 
numbers. More often, one forms a set by enclosing a particular expression within 
curly brackets, where the expression identifies the elements of the set. To illustrate 
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this method of identifying a set, we can form a set B of even natural numbers, using 
the above set A, as follows: 


B={ne€éA:n is even} (A) 


which is read as “the set of n € A such that n is even.” Clearly, B = {2,4,6, 8}. 
So using the set A and the property “n is even,” we formed the set B in (A). More 
generally, given a set C' and a property P(x), we can form the set X = {x € C: P(z)}. 
Thus, X is the set of all objects in C that make P(x) true. In the definition of X, 
the set C' will be referred to as the restriction and X will be called a restricted truth 
set. Moreover, when the restriction is understood or unspecified, we will only write 
X = {a : P(x)} and refer to X as a truth set. 


Example. Consider the set of integers Z. We evaluate the following sets: 


1. {a € Z: x is a prime number} = {2,3,5,7,11,...}. 
2. {cx €Z:z is even} = {..., —8, —6, —4, —2,0,2,4,6,8,... }. 
3. {2€Z: 24 <1} ={-1,0,]}. 
For each of the sets Z, Q, and R, we may add ‘+’ or ‘—’ as a superscript. The 


+ (or — ) superscript indicates that only the positive (or negative) numbers will be 
allowed. For example, QT = {x €Q:x>0} and R- ={reER: x2 < 0}. 


Interval Notation 


A point is a term that will be used to denote a real number. 


Definition 1.1.1. An interval is a set J of real numbers that has at least two points 
and for any two points x,y € J, every real number between zx and y is also in J. 


Of course, the set consisting of all the real numbers that lie between two real 
numbers is an interval. We now identify all of the possible forms of an interval. 


. The open interval (a, b) is defined to be (a,b) = {x ER: a<a2< bd}. 

. The closed interval [a, }] is defined to be [a,b] = {c € R: a< x < Dd}. 

. The left-closed interval [a, 6) is defined to be [a,b) = {x € R:a< a < bd}. 

. The right-closed interval [a, b) is defined to be (a,b] = {x € R:a< a2 < bd}. 

. The interval (a, 00) is defined to be (a,00) = {x € R: a < z}. 

The interval [a, co) is defined to be [a,o0) = { ER: a< a}. 
(—oo, b) is defined to be (—oo, b) = {x € R: « < Db}. 

. The interval (—oo, }] is defined to be (—oo, b] = {x € R: x < 5b}. 
(— 


. The interval 


The interval 


oo, co) is R, the set of all real numbers. 


The numbers a and 8, in items 1-8, are called the endpoints of the interval and a is 
called the left endpoint and 6b is said to be the right endpoint. A point in an interval 
that is not an endpoint is an interior point. An interval may or may not include its 
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endpoints. The symbol oo denotes “infinity” and is not a number or an endpoint. The 

notation oo is a useful symbol that allows us to represent intervals that are “without 

an end.” The notation —oo is used to denote an interval “without a beginning.” 
Some sets of real numbers, but not all, can be expressed in terms of an interval. 


Example. We evaluate the sets (1) {x € R: x? — 1 < 3} and (2) {tc ER” : x > 4} 
in terms of an interval as follows: 


(1) Solving the inequality 2? — 1 < 3 for x”, we obtain x? < 4. The solution to this 
latter inequality is —2 < x < 2. Thus, {r €R: 2? -1 < 3} = (—2,2). 

(2) We need to find all the real numbers x < 0 that satisfy 7 > 1, We conclude that 
a? <1. Thus, -1<2<0. So, {c€R™ : x > +4} =(-1,0). 


Definition. A positive rational number ™ is in reduced form ifm € N and n ¢ N 


have no common factors greater than 1. 


Example. The rational number ; is in reduced form and a is not in reduced form 


because 7 and 14 have a common factor greater than 1. However 
reduced form. 


(an ‘ Perec 
, 14 = 3 Which is in 


As illustrated in the above example, each positive rational number can be ex- 
pressed in terms of a ratio in reduced form. 


Lemma 1.1.2. Let a,b € Z. If pis a prime and p divides ab, then either p divides a 
or p divides b. 


Theorem 1.1.3. Let p € N be a prime number. Then ,/p is an irrational number. 


Proof. Let p € N be a prime. Assume, for a contradiction, that \/p is rational. Thus, 
/p = * for souls m,n € N where n 4 0 and ™ is in reduced form. Since \/p = ™, we 
have that p= ™> and (x) m? = pn. Hence, p evenly divides m?. Since p is a prime, 
p evenly divides m by Lemma 1.1.2. So, m = pk for some k € N. After substituting 
m = pk into (x), we obtain p2k? = pn?. Therefore, n? = pk?. Thus, p evenly divides 
n? and so, p evenly divides n. Hence, m and n have p as a common factor. It follows 


that ae is not in reduced form. Contradiction. 


1.1.2 How to Prove an Equation 


Equations play an important role in mathematics. In this text we will establish many 
theorems that require us to correctly prove an equation. Since this is so fundamental, 
our first proof strategy presents two correct methods for proving equations. 


Proof Strategy 1.1.4. To prove a new equation y = w, there are two approaches: 


(a) Start with one side of the equation and derive the other side. 


(b) Perform operations on any given equations to derive the new equation. 
We now apply strategy 1.1.4(a) to prove a well-known algebraic identity. 


Theorem. Let x and y be arbitrary real numbers. Then (x + y)(x — y) = 2? — y?. 
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Proof. We start with the left hand side (x + y)(x— y) and derive the right hand side 
as follows: 


(c+ y)(x — y) = x(x —y) + y(x—y) by the distributive property 
a ry + yx — y? by the distributive property 
= 7? — y? by algebra. 


Therefore, (x + y)(x — y) = x7 — y’. 
We now apply strategy 1.1.4(b) to prove an equation from some given equations. 


Theorem. Let x, y,i,7 be real numbers. Suppose that « = 27+ 5 and y = 37. Then 
xy = 607 + 157. 


Proof. Assume that x = 2i+5 and y = 27. By multiplying corresponding sides of these 
two equations, we obtain ry = (27 + 5)(37). Thus, by algebra, ry = 617 + 157. 


Remark 1.1.5. To prove that an equation y = w is true, it is not a correct method 
of proof to assume the equation y = w and then derive an identity. 


The method described in Remark 1.1.5 is invalid and, if applied, can produce false 
equations. For example, this incorrect method can be used to deduce the equation 
—2 = 2. To illustrate this, let us assume the equation —2 = 2. Now square both 
sides, obtaining (—2)? = 2? which results in the true equation 4 = 4. The method 
cited in Remark 1.1.5 would allow us to conclude that —2 = 2. This is nonsense. In 
mathematics, one never applies a method that can produce false results! 


1.1.3 How to Prove an Inequality 


Many of the proofs in real analysis involve working with inequalities. In this section, 
we review the properties of inequality that are frequently applied in analysis. As you 
may recall, to prove a new inequality from some given inequalities is typically more 
difficult than proving an equation. The major reason for this added difficulty: one 
has to correctly use the Laws of Inequality. 


Laws of Inequality 1.1.6. For all a,b,c,d € R, the following hold: 


1. Exactly one of the following holds: a < b ora =bora> b. (Trichotomy) 
2. Ifa<bandb<c, thena<ce. (Transitivity Law) 
3. Ifa<b, thena+c<b+c. (Adding on both sides) 
4. Ifa<bandc>0, then ac < be. (Multiplying by a positive) 
5. Ifa<bandc<0, then ac > be. (Multiplying by a negative) 
6. ifa<bandc<d,thena+c<b+d. (Additivity) 


We write a > b when 6 < a, and a < Bb states that a < b or a = BD. Similarly, 
a > b means that a > b or a = b. The Trichotomy Law allows us to assert that if 
a <b, then a > b. We note that one can actually prove laws 5 and 6 from laws 1 to 
4. Moreover, one can also prove that 0 < 1 and —1 < 0. 
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Theorem 1.1.7. Let a,b,c be real numbers where a < b. Then a—c < b—c. 


Proof. Let a,b,c be real numbers such that a < b. From the inequality law 3, we 
obtain a + (—c) < 6+ (—c). Thus, by algebra, we conclude that a—c < b-—c. 


The following principles of inequality are based on the Laws of Inequality 1.1.6 
and are frequently applied in real analysis. 


Sum and Product Principles of Inequality 1.1.8. Let a, p, x, y be real numbers. 
Then the following hold: 


(1) Given the sum a+ 2, if x < y, then you can conclude that a+ 2 <a+y. 
(Replacing a summand with a larger value yields a larger sum.) 


(2) Given the sum a+ 2, if x > y, then you can conclude that a+ 2 >a+y. 
(Replacing a summand with a smaller value yields a smaller sum.) 


(3) Given the product px where p > 0, if x < y, then you can infer that px < py. 
(Replacing a factor with a larger value yields a larger product.) 


(4) Given the product px where p > 0, if x > y, then you can infer that px > py. 
(Replacing a factor with a smaller value yields a smaller product.) 


Principle (1) holds for < as well (ie., upon replacing both occurrences of < in (1) 
with <). The above (2) also holds for >. Moreover, (3) holds for < when p > 0; and 
(4) holds for > when p > 0. 


We give two proofs of Theorems 1.1.9 and 1.1.10 below. Each first proof uses the 
Laws of Inequality 1.1.6, while each corresponding second proof uses the Sum and 
Product Principles of Inequality 1.1.8. Of these two proofs, which is easier to follow? 


Theorem 1.1.9. Let a,b,c,d be real numbers and suppose that a < 6 and ¢ < d. 
Then a+c<b+4d. 


First Proof. Let a,b,c,d be real numbers satisfying (A) a < b and (@) c < d. We 
prove that a+c < b+d. From (A) and law 3 of the Laws of Inequality 1.1.6, we 
obtain a+ c<b+c. From (@) and law 3 again, we conclude that b+ c < b+d. So, 
at+c<b+c<b-+d. Therefore, a+c<b+d. 


Second Proof. Let a,b, c,d be real numbers satisfying a < b and c < d. We show that 
a+c<b+das follows: 


at+e<b+c asa<b (see 1.1.8(1)) 
<b+d asc<d (see 1.1.8(1)). 


Therefore, a+c<b+d. 


Theorem 1.1.10. Suppose a and 6 are real numbers. If 0 < a < b, then a? < b?. 


First Proof. Assume 0 < a < b. We show that a? < 6”. Multiplying both sides of the 
inequality a < b by the positive a yields the inequality (a) a? < ab, and multiplying 
both sides of the inequality a < b by the positive b gives the inequality (@) ab < 0. 
Thus, (4) and (@) imply a? < ab < b?. Thus, a? < b?. 
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Second Proof. Assume 0 < a < b. We show that a? < b? as follows: 


a” = aa by algebra 


< ab as a < b (see 1.1.8(3)) 
<bb=b? asa<b (see 1.1.8(3)). 


Therefore, a? < b?. 


In elementary mathematics we learned that for any real number x > 0, there is a 
unique real number y > 0, such that y? = x. We write y = \/x and say that y is the 
square root of x. Consequently, (,/z)? = x. Our next theorem shows that the square 
root operation preserves the inequality relation < for positive numbers. 


Theorem 1.1.11. Suppose a and b are real numbers. If 0 < a < b, then \/a < Vb. 


Proof. Suppose 0 < a < b. We will prove that /a < Vb. Suppose, for a contradiction, 
that Vb < Ja. If Vb = Va, then b = (Vb)? = (\/a)? = a. Contradiction. If Vb < Va, 
then b = (Vb)? < (\/a)? = a by Theorem 1.1.10, and so b < a. Contradiction. 


One can also prove the following extension of Theorem 1.1.10 and Theorem 1.1.11 
(see Exercises 3 and 7 on pages 26 and 27). 


Theorem 1.1.12. If 0< a<_}, then a” < b” and ce bn, for alln EN. 
We end this section by establishing two fundamental properties of inequality. 
Theorem 1.1.13. Let a,b,c,d > 0. If a < band c < d, then ac < bd. 


Proof. Let a,b,c, d be positive real numbers satisfying (A) a < b and (@) c < d. From 
(A) we conclude that ac < bc because c > 0. From (@) we obtain bc < bd as b > 0. 
So, ac < bc < bd. Therefore, ac < bd. 


Corollary 1.1.14. Suppose a,b,x,y > 0. Ifa < band x < y, then ax < by. 


Proof. Let a,b,x,y > 0, a <b, and x < y. We prove that ax < by. There are several 
cases to consider. If a = b and x = y, then ax = by and so, ax < by. If a = b and 
x <y, then ax < ay = by and so, ax < by. Ifa < band x = y, then ax < bx = by and 
so, ax < by. Ifa < band x < y, then ax < by by Theorem 1.1.18. So, ax < by. 


1.1.4 Important Properties of Absolute Value 


We shall now review the absolute value function and some of its properties. This 
function will be more formally discussed is Section 2.2.1. Given a real number 2, the 
absolute value of x, denoted by |z|, is defined by 


For all a,b,2z,c € R, where c > 0, we have (see Theorems 2.2.6 and 2.2.7) 
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> 


a+ b| < jal + [OI 
Ja] — ||| < |a—5| 


z| < cif and only if -c<a4<c 


x| > cif and only if <—corr>c 


= 


ae b| < b 
x < |x| and —z < |z| 8. |a| — |b] < |a—5| 


ab| = |a||b| 9. |b] — |a| < |a — 4]. 


OR ey ee 


Item 6 is called the triangle inequality, and we shall refer to items 7-9 as the 
backward triangle inequality. 


1.1.5 Proof Diagrams 


In Section 1.4.1 we introduce a diagram that encapsulates proof by mathematical 
induction (see page 23). This diagram is obtained by identifying the logical structure 
of proof by induction. One can use the diagram as a guide to produce a correct 
proof by mathematical induction. All mathematical proofs can be analyzed in terms 
of the logical structure of the statement to be proven. This structure allows one to 
“diagram” a mathematical proof. The proof diagram demonstrates the structure of 
the proof and provides a tool for writing a correct mathematical proof. 

We now illustrate how a proof diagram can be used to write a proof that concerns 
inequalities and the absolute value function. As will be seen, the proof applies the 
Sum and Product Principles of Inequality 1.1.8. 


Theorem. For every ¢ > 0, there exists a 6 > 0 such that if |2—1] < 6, then 
|(3a + 2) —5| <e. 


Before proving the above theorem, we present a “proof analysis” which discusses 
how to obtain the necessary values for a correct proof. This analysis is not part of the 
proof, but it does provide all the necessary ingredients that are needed to compose a 
valid proof. 


Proof Analysis. The statement of the theorem has the logical form 


(Ve > 0)(45 > 0)(la — 1] < 5 > |(3e +2) —5] <e). 


Given this logical form, by applying the strategies outlined on page 256, we obtain 
the following “proof diagram:” 


Let ¢ > 0. 
Let 6 = (the positive value you found). 
Assume |x — 1| < 6. 
Prove |(3z + 2) — 5] <e. 


The indentation indicates the proof’s logical dependencies. So we have ¢ > 0, and we 
need to find a 6 > 0 so that if |x — 1| < 6, then |(3x + 2) — 5| < e. Using algebra on 
the expression |(3z + 2) — 5], we extract out |x — 1| as follows: 
|(3a + 2) — 5| =|32 —3]| _ by algebra 
= 3|x— 1] by algebra and property of absolute value. 
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Thus, if we have that 3|a — 1] < ¢, we can conclude that |(37 + 2) — 5| < e. Solving 


the inequality 3|2 —1| < e for |x — 1], we obtain |x — 1| < §. So, we will let 6 = 


€ 
3° 


We can now compose a correct proof, using the preceding proof diagram as a guide. 


Proof. Let ¢ > 0. Let 6 = = Assume (x) |z — 1| < 6. We prove that |(8x2 + 2) — 5| <e 


as follows: 


Therefore, |(32 + 2) — 5| < e. 


|(3a + 2) — 5| = |3(@ — 1)| _ by algebra 
=3|x2—-—1| _ by property of absolute value 


< 36 by (x) and property of inequality 
E 
=05 = since J = 5. 


The above analysis and proof illustrates a theme that will reoccur in the text. We 


will be presenting proof diagrams that explicitly deal with the core definitions and 
proofs that are an essential part of real analysis. Such proofs will be preceded by a 
“proof analysis” and then followed by the actual proof. The reader will thus see how 
to develop and execute a plan of attack for finding and composing a correct proof. 


Exercises 1.1 


OU 


. Let x and y be real numbers. Prove that (x — y) 


. Let y be the positive real number satisfying ? — y—1 = 0. Prove that py = —. 


x’ +ayty’)=22—y?. 


1 ( 

2. Let x and y be real numbers. Prove that (x + y)(x? — ry + y?) = 2? + y°. 
3. 
4 


Let x and y be real numbers. Prove that (a + y)? = x? + 2xy + y”. 


. Using Exercise 3, prove that (x+y)? = 23 +327y+3zry?+y? for all real numbers 


x and y. 


1 
p-1" 


. Let y be as in Exercise 5. Let a ¥ b be real numbers satisfying 8 = y. Prove 


that = = ¢. 


. Let c< 0. Solve —2cx — 10 > —2c — 10 for x. 


. Let c< 1. Solve a >ct+1 for zx. 

. Let x be a real number such that x2 > 1. Prove that x? > x. 

. Let x be a real number where x < 0. Prove that x? > 0. 

. Let x be a real number where x > 0. Prove that x? > 0. 

. Let x be a nonzero real number. Using Exercises 10 and 11, prove that x? > 0. 
. Let a and b be distinct real numbers. Using Exercise 12 prove that a?+ b? > 2ab. 
. Let x be a real number so that x? > x. Must we conclude that x > 1? 


. Let x be a real number satisfying 0 < x < 1. Prove that 2? < z. 


16. 
17. 
18. 


19. 


20. 


21. 


22. 


23. 
24. 
25. 


26. 
27. 
28. 


29. 
30. 
31. 
32. 
33. 
34. 


35. 
36. 
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Let x be a real number where x? < x. Must we conclude that 0 < x < 1? 
Let a and 6 are real numbers where a < b. Prove that —a > —b. 


Let a,b be positive real numbers and let c,d be negative real numbers. Suppose 
a<bandc<d. Prove that ad > bc. 


Find a counterexample showing that the following conjecture is false: Let a, b, c, d 
be natural numbers satisfying z < oe Then a<candb< d. 


Find a counterexample showing that the following conjecture is false: Let m > 0 
and n > 0 be integers. Then m+n <m-n. 


Find a counterexample showing that the following conjecture is false: Let x > 0 
and y > 0 be real numbers. Then \/x + y = /x+ /y. 


Let a,b,c,d be real numbers. Suppose that a+ 6=c+danda<_c. Prove that 
d < b. [Hint: x < y if and only if x—y <0] 
Let a > 0 be a real number. Prove that 4 > 0. 


Suppose that 0 < a < b. Prove that : < t. 


Let x and y be real numbers where x > 0. Using Exercise 23, prove that if xy > 0, 
then y > 0. 


Let 6 > 0. If |x — 1| < 6, then [3x + 5| < 36 +8. 
Prove that if |x — 1] < 2, then 4 < |x + 5]. 


Prove that for all ¢ > 0 there exists a 6 > 0 such that if |z+1| < 6, then 
\(2z + 5) —3| <e. 

Let x be a real number. Prove that 0 < «+ |z| < 2|z]. 

Let 6 > 0. Prove that if |a — 5| < %, then |3x2 — 15| <6. 

Let 6 > 0. Prove that if |x — 5| < 6, then |x +3] <6+8. 

Prove that if |z+5| <1, then 1 < |x +3}. 

Let 6 > 0. Prove that if |z — 3] < 6, then |x? — 9| < 6(6 + 6). 

Prove that for every real number x > 3, there exists a real number y < 0 such 
that «= at 


Prove that for all real numbers x, if « > 1, then 0 < 4 <1, 


Evaluate each set in terms of an interval: 
(a) {c ER*: a> 4} (c) {c ER*:2>4 and z> 2} 
(b) {c €R7: a? > 4} (d) {c ER”: a> anda % —5}. 
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1.2 SETS 


Many of the most important ideas in modern mathematics are expressed in term of 
sets. Thus, it is good to have a basic understanding of sets. In this section, we will 
review a few elementary facts about sets. A set is just a collection of objects. Again, 
these objects are referred to as the elements of the set. We will usually use upper 
case letters to denote sets and lower case letters to denote elements of a set. 


1.2.1. Basic Definitions of Set Theory 


Definition 1.2.1. The following set notation is used throughout mathematics: 


1. For sets A and B, we write A = B when both sets have exactly the same elements. 


2. For sets A and B, we write A C B when A isa subset of B; that is, every element 
of A is also an element of B. 


3. For sets A and B, we write A C B to state that A is a proper subset of B; that 
is, AC Band AF B. 
4. We write @ for the empty the set, that is, the set with no elements. 


5. If A is a finite set, then #A denotes the number of elements in A. 


We are familiar with the following subset relationships NC ZC QCR. 


1.2.2 Set Operations 


The language of set theory is used in the definitions of nearly all of mathematics. 
There are three important and fundamental operations on sets that we shall now 
discuss: the intersection, the union, and the difference of two sets. We illustrate these 
three set operations in Figure 1.1 using Venn diagrams, where shading is used to 
identify the result of each set operation. Venn diagrams are geometric shapes that 
can be used to depict sets and their relationships. 


Definition 1.2.2. Given sets A and B, we can construct new sets using the following 
set operations: 


1. AUB={x:x2¢€Aorz€ B} is the union of A and B. 
2, ANB={x:x¢€ Aand z € B} is the intersection of A and B. 
3. A\ B={x:xe€ Aandz ¢ B} is the set difference of A and B. 


The operation A \ B is often stated as “A minus B.” We will be mainly applying 
Definitions 1.2.1 and 1.2.2 to sets of real numbers. 


Example. Let A= {1,2,3,4,5,6} and B = {2,4,6,8,10,12}. Then 


1. AUB = {1,2,3,4,5,6,8, 10, 12}. 3. A\ B= {1,3,5}. 
2. AN B= {2,4,6}. 4. B\ A= {8, 10,12}. 
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1. Shaded area depicts AU B 2. Shaded area depicts AN B 


A B 
3. Shaded area depicts A \ B 


Figure 1.1: Set operations. 


1.2.3. Indexed Families of Sets 


Given a property P(x), recall that we can form the set {x : P(«)} when the restriction 
is understood. However, there is another way to construct a set. For example, consider 
the set S' of all even natural numbers, that is, the set of all integers of the form 2n 
for some natural number n. We can define S' in two ways: 


1S ={27Gn€N) (2 =2n)} = {2,4,6,8,12,~-2}. 
2. S={2n:neN} = {2,4, 6,8, 12,---}. 


In item 1, we have expressed S as a truth set. Item 2 offers an alternative method 
for constructing the exact same set S. This alternative method is a special case of 
the following technique for constructing sets from the set N of natural numbers. 
Suppose for each i € N, that 0; is some specific object. Then we can form the set 
S' = {o; : i € N} of all such objects. In this case, the set N is said to be the index set 
and S' is called an indexed set. As this concept is often used in mathematics, we now 
formulate this idea in terms of a general definition. 


Definition 1.2.3. let J be any set and for each z € I, let 0; be some specific object. 
Then we can form the set S = {0; : 7 € I} where J is referred to as the index set and 
S is called an indexed set. 


For example, for each 7 € Q we have the number cos(7). Thus, we can construct 
the indexed set {cos(z) : i € Q}. 


Problem. Explain what the following statements mean. 

1. y€ {cos(t) : 7 € Q}. 

2. {4:1 EI} CA, 

3. {4 :t EIS ZA. 

Solution. The first statement y € {cos(z) : 7 € Q} means that y = cos(z) for some 


i € Q. The second statement {x; : i € I} C A means that x; € A for every i € J. 
Finally, the third statement {z; :i € J} Z A means that x; ¢ A for some i € I. 
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Definition 1.2.4. A set #, whose elements are sets, is a family of sets. 


Definition 1.2.5. Let J be a set and for each 7 € I, let C; be a specific set. Then we 
can form the set F = {C; : 7 € I}, where J is the index set and F is an indexed 
family of sets. 


Example. Suppose for each natural number n, we define the set A, = {0,1,2,...,}. 
Then F = {A, :n € N} = {Aj, Ag, As,... } is an indexed family of sets. 


Example 1.2.6. For each real number x > 0, let B, = (—xz,2 + 1). We now can 
define an indexed family of sets by F = {B,: x € R*}. 


Example 1.2.7. Let J = {i € R: i> 1}. For each real number i € J, let B; = [—i, §]. 
Thus, F = {B;: 7 € I} is an indexed family of sets. 


1.2.4 Generalized Unions and Intersections 


For any two sets A and B, we can form the union AU B and the intersection AN B of 
these sets. In mathematics, one often forms the union and intersection of many more 
than just two sets. We will now generalize the operations of union and intersection 
so that they apply to more than just two sets. We first extend the notions of union 
and intersection to a finite number of sets, and then to any collection of sets. 

We know that « € AUB means that z is in at least one of the two sets A and B. 
This notion of union can be easily extended to more than two sets. For finitely many 
sets, say A, Ao,... An, we say that x is in the union 


A, U Ag U++-UAy 


when z is in at least one of the sets A,, Ao,... An; that is, x € A; for somel <i<n. 
Using I = {1,2,...,n} as an index set, we write 


J A; = Ai U Ap U+++U An 
tel 
and so, x € (J A; means that x € A; for some i € J. 
tel 
Clearly, c € AN B means that 2 is in both of the two sets A and B. For finitely 
many sets, say A,, Ao,... An, we shall say that x is in the intersection 


AyN AgN-++N An 


when zx is in every one of the sets A,, Ao,... An; that is, xr € A; for every 1 <i<n. 
Using J = {1,2,...,n} as an index set, we write 


() Ai = AlN AQN- +N An 
tel 
and so, x € (| A; means that x € A; for every i € I. 


tel 
Similarly, we can form the union and intersection of any indexed family of sets 
{C; : 7 € I}, where J can be a finite or infinite set. 
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Definition 1.2.8. Let {C; : i € I} be an indexed family of sets. The union L C; is 
tel 
the set of elements x such that x € C; for at least one i € I; that is, 


LJ C; = {x: x € C; for some i € J}. 
tel 


Definition 1.2.9. Let {C; : i € I} be an indexed family of sets. The intersection 


() C; is the set of elements x such that x € C; for every 7 € I; that is, 
tel 


(| GHi{e wed, for every 7 € I}. 
wel 


The next remark can be used to prove set identities that involve indexed unions 
and/or intersections. The phrase “iff” abbreviates the expression “if and only if.” 


Remark. Let {C; : i © J} be an indexed family of sets. The following hold: 


(1) ce UC, iffzeC,forsomei el. (3) € (\C, iff x € C, for everyic I. 


wel tel 
(2) c¢ UC, iffe¢C,foreveryiel. (4) ¢¢ (\C, iffz € C, for someie I. 
i€l ier 


De Morgan’s Laws for Indexed Families of Sets 


Theorem 1.2.10. Let {B;:i€ I} be a family of sets and let A be a set. Then 


(1) AV\UB = N(A\ Bi). 


tel wel 


Proof. We prove (1) and leave (2) as an exercise. For any x, we prove that 


xe A\ (JB; iff re (\(A\ Bi), 


iel iel 
as follows: 
ce A\|JB; iff ce Aand « ¢ | J)B; by the definition of \ 
iel ie€l 
iff ¢ € Aand « ¢ B; for every i € I by the definition of U 
iff « € A\ B; for everyie I by the definition of \ 
iff « € { \(A\ Bi) by the definition of {(). 


wel. 


Therefore, A \\ U Bi = (\(A\ Bi). 


el ier 
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1.2.5 Unindexed Families of Sets 


Indexed families of sets occur frequently in mathematics. Mathematicians also deal 
with families of sets (see Definition 1.2.4) that are not identified as an indexed set. 
When F is a family of sets, the union UF is the set of elements x such that x € C 
for some C' € F; that is, 


JF = {x: x €C for some C € F}. 


The intersection (|F is the set of elements x such that x € C for every C € F; 
that is, 
()F = {x: x €C for every C € F}. 


For example, let F be the family of sets defined by F = {{1,2,9}, {2,9}, {4,9}}. 
Then UF = {1,2,4,9} and (VF = {9}. An “unindexed” version of De Morgan’s 
Theorem 1.2.10 also holds. 


Theorem 1.2.11. Suppose that A is a set and that F is a family of sets. Then 


(1) A\UF =(YA\B: BEF}, 
(2) A\NF =U{A\ B: Be F}. 


Exercises 1.2 


1. Recalling our discussion on interval notation on page 2, evaluate the following 
set operations in terms of an interval: 


(—2, 4) U (—00,2). 


R \ (2,00). 
(e) (R \ (-00, 2]) U (1, 00). 
2. Let J = {2,3,4,5}, and for each i € J, let C; = {1,2 + 1,7 — 1, 2%}. 
(a) For each 7 € J, list the elements of Cj. 
(b) Find (] C; and U CG. 
i€l i€l 
3. For each n € N, let O, be the open interval O,, = (1,1 ++). Then {O, : n € N} 


is an indexed family of sets. Evaluate the sets: () On, and U On. 
nen nen 


A. Let I = {i€ R: 1 <i} =[1, 00) and let A; = {c €R: —} <x < 2-34} for each 
i € I. Express (J A; in interval notation, if possible. Express () A; in interval 
iel iel 
notation, if possible. 


5. Prove Theorem 1.2.10(2). 


6. Prove the following theorems: 
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(a) Theorem. Let {A; : i € J} and {B; : i € I} be indexed families of sets with 
indexed set J. If A; C B; for alli € J, then U A; C U Bj and 1) A; C () Bi. 
i€l i€l i€l il 
(b) Theorem. Let {A; : i € J} and {B; : 7 € J} be indexed families of sets. If 
io € J is such that A,;, C B; for all 7 € J, then (1) A; € (| Bj. 
iel je 
(c) Theorem. Let A be a set and {B; : i € I} be an indexed family of sets. 
i€l i€l 
(d) Theorem. Let A be a set and {B; : i € I} be an indexed family of sets. 
ier iel 
(ec) Theorem. Let A be a set and {B; : i € I} be an indexed family of sets. 
iel i€l 
7. Prove Theorem 1.2.11. 


8. Let {B, : x € R*} be as in Example 1.2.6. Evaluate () B, and U By. 


xERt xzeERt 


9. Let {B; : 7 € I} be as in Example 1.2.7. Evaluate () B; and U Bi. 
ic I il 


1.3 FUNCTIONS 


Another fundamental concept in mathematics is the notion of a function. One can 
formally define this concept using set-theoretic principles; however, as the reader is 
already familiar with functions, we will essentially just review the key features of a 
function. Recall that a function is a way of associating each element of a nonempty 
set X with exactly one element of another set Y. In calculus, functions are typically 
defined in terms of a formula, but in real analysis one needs to view functions in a 
more general setting. 


Definition 1.3.1. We write f: A > B to mean that f is a function from the set 
A to the set B, that is, for every element x € A there is exactly one element f(z) 
in B. The value f(x) is called “f of x,” or “the image of x under f.” The set A is 
the domain of the function f and the set B is the co-domain of the function f. In 
addition, we shall say that x € A is an input for the function f and that f(x) is the 
resulting output. We will also say that x gets mapped to f(z). 


In the above definition, keep in mind that the notation f represents a function 
and the notation f(x) identifies the value of the function at x. In particular, f £ f(z). 


Remark. If f: A — B, then each zx € A is assigned to exactly one element f(a) in 
B.So f is single-valued, that is, for alla € Aand z € A, ifx = z, then f(x) = f(z). 


Definition 1.3.2. The range of a function f: A — B, denoted by ran(f), is the set 
ran(f) = {f(a):a¢ A} ={be B: b= f(a) for some a € A}. 


The range of a function f is the set of all the “output” values produced by /. 
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Question. Let h: X > Y bea function. What does it mean to say that b € ran(h)? 
Answer: b € ran(h) means that b = f(x) for some x € A. 


Example. Let f: R — R be the function in Figure 1.2 defined by the formula 
f(z) =x? —1. Then ran(f) = {f(x) : 2 € R} = {2? -1: 2 € R} = [-1, 00). 


—1 


Figure 1.2: Graph of f(x) = x? — 1. 


1.3.1 Real-Valued Functions 


Real-valued functions are the focus in a calculus course and they will be the focus in 
this text as well. A real-valued function is one that has the form f: D > R, that 
is, the output values of the function f are real numbers. In this book, the domain of a 
real-valued function will typically be a set of real numbers. A polynomial function 
of degree n is a real-valued function of the form 


f(z) = Ant" + ee ol +++ GX + ao, 


where Gy,...,@1,@ 9 are real number constants, n is a natural number, and a, 4 0. 
A rational function is one that is defined as the ratio of polynomials. A constant 
function has the form f(x) = a for all x € D where a is a constant. The zero 
function is the constant function where a = 0. 


Definition 1.3.3. A f: R > R is an even function if f(—x) = f(x) for allz ER. 
If f(—x) = —f (a) for all x € R, then f is an odd function. 


1.3.2 Injections and Surjections 


There are two fundamental properties that a function may possess; namely, a function 
may be injective and/or surjective. 


Definition. A function f: X — Y is an injection (or one-to-one), if distinct 
elements in X get mapped to distinct elements in Y; that is, 


for all a,b € X, ifa #b, then f(a) F f(b), 


or equivalently, 


for all a,b € X, if f(a) = f(b), then a = b. 
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Definition. A function f: X — Y is a surjection (or onto Y), if for each y € Y, 
there is an x € X such that f(x) = y. 


Definition. A function f: X > Y is a bijection, if f is an injection and surjection. 
Concerning the above three definitions, we make three observations: 


e f: X 4 Y isan injection if and only if for each y € Y there is at most one x € X 
such that f(x) = y. An injection is also said to be injective. 


e f: X > Y isa surjection if and only if for each y € Y there is at least one x € X 
such that f(x) = y. A surjection is often referred as being surjective. 


e f: X > Y isa bijection if and only if for each y € Y there is exactly one x € X 
such that f(x) = y. A bijection is also described as being bijective. 


1.3.3 Composition of Functions 


If f: X — Y and g: Y — Z, then for any x € X, we have that f(x) € Y. So g(f(z)) 
is an element in Z (see Figure 1.3). This “composition” g(f()) allows us to assign 
elements in X to elements in Z. Thus, we can define a new function. 


Definition. Given functions f: X + Y and g: Y > Z, one forms the composition 
(go f): X > Z by defining (go f)(x) = g(f(x)) for all x € X. The function go f is 
a composite function. 


gof 
Figure 1.3: f(x) = y, g(y) = 2, and g(f(x)) = z. 


We now present results on the composition of injections and surjections. 
Theorem 1.3.4. If f: X > Y and g: Y —> Z are injections, then (go f): X > Z 
is an injection. 


Proof. Let f: X + Y and g: Y > Z be injections. Let a,b € X. Assume that 
(go f)(a) = (go f)(b). Thus, (x) g(f(a)) = g(f(b)) by the definition of composition. 
Since g is an injection, (x) implies that f(a) = f(b). As f is an injection, we conclude 
that a = b. Therefore, (go f) is an injection. 


Theorem 1.3.5. If f: X > Y and g: Y > Z are surjections, then (go f): X > Z 
is a surjection. 

Proof. Let f: X + Y andg: Y > Z be surjections. Let z € Z. Since g is a surjection, 
there exists a y € Y such that g(y) = z. As y € Y and f is a surjection, there is an 
x € X such that f(x) = y. Thus, (go f)(x) = g(f(x)) = g(y) = z. Hence, (go f) isa 
surjection. 
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1.3.4 Inverse Functions 


A function f: X — Y assigns elements in X to elements in Y. This assignment can 
be partially reversed if f is an injection. 


Theorem 1.3.6. Let f: X — Y be an injection and let R = ran(f). Then there 
is a function f~!: R + X defined as follows: For each y € R, f~'(y) is the unique 
element x in X such that f(x) = y. Thus, for all ye Rand « € X 


f(y) == iff f(z) =y. (1.1) 


Proof. Let f: X — Y be an injection and let R = ran(f). Let y € R. Thus, there is 
an x € X such that f(x) = y. Suppose that x’ € X also satisfies f(x’) = y. Thus, 
f(x) = f(a’). Because f is injective, it follows that x = x’. So for every y € R there 
is exactly one x € X such that f(x) = y. Hence, the formula f(x) = y used in (1.1) 
defines the function f~!: RX. 


Definition 1.3.7. Given an injection f: X > Y, let R = ran(f). Then the function 
f-': R-> X, satisfying (1.1) for all y € R, is the inverse function of f. 


We now show that if a function and its inverse are functionally composed, then 
the result is an identity function 7, that is, i(x) = x for all x in the domain of i. 


Theorem 1.3.8. Let f: X — Y be injective. Let R = ran(f) and let f-': RX 
be the inverse of f. Then (f~!o f): X 4 X and (fo f-'): R= R. Moreover, 


(a) f: X > Ris a bijection, 

(b) (f-lo f)(x) =a, for allz eX, 

(c) (fof )(y) =y, for all ye R. 
Proof. Item (a) should be clear. Let x € X. Since f(x) € Y, let y € Y be such that 
f(x) = y. Theorem 1.3.6 implies that («) f~'(y) = «x. After substituting y = f(x) into 
equation (*), we see that f~!(f(x)) = x. So (b) holds. To prove (c), let y € R. Since 


f-'(y) € X, let x € X be such that f~'(y) = x. Thus, (f) f(x) = y by Theorem 1.3.6. 
Upon substituting x = f—'(y) into equation ({), we obtain f(f~!(y)) = y. 


We end this section by showing that the inverse of a bijection is also a bijection. 


Theorem 1.3.9. Suppose that f: X — Y is a bijection. Let f~': Y + X be the 
inverse of f. Then f~! is a bijection. 


Proof. Let f: X + Y bea bijection. So Y = ran(f). We first prove that f-!: Y + X 
is injective. Let y,y/ € Y. Assume f~!(y) = f~1(y’). Let « € X be this common value. 
Thus, f(y) =a and f-'(y/') =z. So f(z) = y and f(z) =y/, by (1.1). Since f is a 
function, we conclude that y = y’. Hence, f~! is an injection. 

To prove that f~': Y — X is surjective, let x € X. So let y € Y be such that 
f(x) = y. By (1.1), f-'(y) = a. Therefore, f~! is a surjection. 


Thus, if f: X > Y is injective, then f: X > Rand f~!: R— X are bijections, 
where R = ran(f). 
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1.3.5 Functions Acting on Sets 


Often in real analysis, one is more interested in what a function does to an entire 
subset of its domain, rather than how the function affects each individual element in 
its domain. This brings us to the following definition. 


Definition 1.3.10 (Image of a Set). Let f: X — Y be a function and let S C X. 
The set f[S], called the image of S, is defined by 


f[S]={f(a): ce Ss={yeY:y= f(z) for some z € S}. 
(Note that f[X] = ran(f).) 


Figure 1.4 illustrates Definition 1.3.10. The square S' represents a subset of the 
domain of the function f. The image f|.S] is represented by a rectangle. 


Figure 1.4: Starting with S C X, we obtain the new set f[S] CY. 


The functional image of a set can lead to a better understanding of the function 
itself and can reveal some properties concerning its domain and range. 


Remark. In some mathematics textbooks, f|[.S] is written as f(S); however, this 
latter notation is ambiguous and its meaning must be inferred from the context. The 
conventional functional notation f(z) means that x is an element of the domain of 
f and it does not mean that x is a subset of the domain. Here, in this book, we will 
avoid such vague notation by denoting the image of set S by f[SI. 


Example 1.3.11. Let f: R — R be defined by f(x) = |z|. Let S = {—4, —3, 2, 3}. 
Then the image of S is f[S] ={f(z): 2 € S}={|z|:2 eS} = {2,3, 4}. 


Given a subset S of the domain of a function f, Definition 1.3.10 identifies a 
subset f[S] of the co-domain of f. This process can be inverted. We can start with a 
subset T of the co-domain and then use it to define a subset of the domain. 


Definition 1.3.12 (Inverse Image of a Set). Let f: X — Y be a function and let 
T CY. The set f~'[7] is the subset of X defined by f-![T] = {x € X: f(x) € Th. 
The set f—'[T] is the inverse image of T. 


A depiction of Definition 1.3.12 is given in Figure 1.5. The circle T depicts a 
subset of the co-domain of f. The inverse image f~![T] is represented by an ellipse. 


Example 1.3.13. Let f: R — R be defined by f(x) = |z| and let T = {-8, 2,3}. 
The inverse image of T is f~'[T] ={x ER: f(x) € T} = {-3, —2, 2, 3}. 
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Figure 1.5: Starting with T C Y, we obtain the new set f~'[T] C X. 


The notation f~! used in Definition 1.3.12, should not be confused with that of 
an inverse function. Theorem 1.3.6 implies that the inverse function exists if and only 
if the original function is an injection. Definition 1.3.12 applies to all functions, even 
those that are not injective. Given a subset of the co-domain of any function f, the 
inverse image of this subset defines a subset of the function’s domain. 

On the other hand, suppose that f: X — Y is an injection and R= ran(f) CY. 
Then the inverse function f~': R > X exists. Now let U C R. In this case, there are 
two ways that the notation f~'[U] can be interpreted: 


1. f-'[U] ={f-+(y): y € U}, the image of U under the inverse function f~'; and 
2. f-'[U] ={xe X: f(x) € U}, the inverse image of U under the function f. 


However, using (1.1), one can show that the two sets in 1 and 2 (on the right side of 
the equality) are equal. So, if the inverse function exists, both interpretations yield 
the same set, and thus, one can use either interpretation without ambiguity. 

The following remark states three observations that can be very useful when 
working with the image, or the inverse image, of a set. 


Remark 1.3.14. Let f: X ~Y,SCX,TCY,aeX anddveY. 


1. IfaeS, then f(a) € f[S]. 

2. be f|[S] if and only if b= f(x) for some x € S. 

3. a€ f(T] if and only if f(a) € T. 

Remark (Image Warning). If f(a) € f[S], then we can conclude that f(a) = f(x) for 


some x € S, by item 2 of Remark 1.3.14; however, we cannot necessarily conclude 
that a € S. In Example 1.3.11, we have that f(4) € f[S] and yet 4¢ 8S. 


Given a function f: X — Y, we now identify four relationships that hold for the 
image and inverse image of subsets of X and Y, respectively. 


Theorem 1.3.15. Let f: X — Y be a function. Let C and D be subsets of X, and 
let U and V be subsets of Y. Then 


(a) fIC ND] ¢ f[C]N f(D] (c) 
(b) fICUD] = f[C] U FID] (d) 
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Proof. Let f: X — Y be a function. We only prove (a) and (d). Let C and D be 
subsets of X and let U and V be subsets of Y. 


(a). We prove f[(CN D] C fiC]N f[D]. Let y € f[CN DI. Since y € f[C nN DI, 
there is an x € CMD such that y = f(x). Because x € C/N D, we see that x € C 
and x € D. Therefore, y = f(x) € f[C] and y = f(x) € f[D]. Thus, y € f[C]N f[D]. 


(d). We prove f~'[U UV] = f7'[U] U f-1[V]. Let x be arbitrary. We shall prove 
that x € f-'[U UV] if and only if z € f-'[U] U f—![V], as follows: 


ce f [UUV] iff f(z) eUUV by definition of inverse image 
iff f(x) € U or f(x) EV by definition of U 
if x € f-*[U] or x € f~'[V] _ by definition of inverse image 
if ce f (Nuff "|v by definition of U. 


Therefore, f-'[U UV] = f-'[U] U f-[V]. 


When a function is injective, then the subset relationship in Theorem 1.3.15(a) 
can be replaced with equality. 


Theorem 1.3.16. Let f: X — Y be a function. Let C' and D be subsets of X. If f 
is an injection, then f[CN D]| = f[C] 9 f[D]. 


Proof. Let f: X — Y be injective and C, D be subsets of X. By Theorem 1.3.15(a), 
f[IC ND C f[C] A f[D]. To prove that f[C]N f[D] C f[Cn DI, let y € f[C] f[D]. 
Thus, y € f[C] and y € f[D]. Because y € f[C], there is an x; € C such that 
f(r1) = y. Also, since y € f[D], there is an x2 € D such that f(r2) = y. Hence, 
y = f(a) = f(x2). Since f is injective, we infer that x; = x2. Thus, x, € D. So, 
x1 € CND and therefore, y = f(x1) € f[C MD]. Hence, f[CN D] = fi[C]N FD]. 


Remark 1.3.17. If f: D — R and [a,b] C D, then we denote the image f |[a, bj] by 
f([a,5]), as an aid to readability. 


Exercises 1.3 


1. Using Definitions 1.3.10 and 1.3.12, explain why items 1-3 of Remark 1.3.14 hold. 


2. Let f: X > Y be a function. Let S be a subset of X, and let T be a subset of 
Y. Prove that f[S] C T if and only if for all « € S we have f(x) € T. 


3. Prove item (b) of Theorem 1.3.15. 
4. Prove item (c) of Theorem 1.3.15. 


5. Given a,b € R with a > 0, define the function f: R > R by f(x) = ax + b. Let 
U = [2,3]. Evaluate f[U] and f~'[U] in terms of intervals. 


6. Let f: X > Y be a function and let A C X and B C X. Prove that if A C B, 
then f[A] C f(B] 
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ao: 


10. 


11. 


12. 


*13. 


14. 
15. 


16. 
17. 


18. 
19. 
20. 


21. 
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. Let f: R > R be the function defined in Example 1.3.11. Find AC Rand BCR 


such that f[A] C f[B] and A Z B. 


. Suppose f: X — Y is an injection and let A C X and B C X. Prove that. if 


f[A] C f[B], then A C B. 


Let f: X > Y bea function and let C C Y and D CY. Prove that if C C D, 
then f-1[C] C f-1[D}. 


Let f: R — R be the function defined in Example 1.3.13. Find CC R and 
D CR such that f~'[C] C f-'[D] and C ¢ D. 


Let f: X 4 Y be onto Y and let C CY and D CY. Prove if f~*[C] C f71[D], 
then C C D. 


Define the function f: R > R by f(z) = x? and let U = [—1, 4]. Show that 
(a) f[f*[U]] 4 U, (c) (FUT) A FTA. 
(b) f-*[f[U]] 4 U, 


Let f: X > Y bea function and let A C X and C CY. Prove that f[A] C C if 
and only if A C f~"[C]. 


Let f: X > Y be a function. Let A be a subset of X. Prove that A C f~![f[A]]. 


Suppose f: X — Y is an injection. Let A C X and x € X. Prove if f(x) € f[A], 
then x € A. 


Suppose that f: X > Y is injective. Let A C X. Prove that A = f~'[f[A]]. 


Let f: X + Y. Suppose A = f~'[f[A]] for all finite subsets A of X. Prove f is 
injective. 


Let f: X > Y be a function. Let C be a subset of Y. Prove that f[f~1[C]] C C. 
Assume that f: X — Y is a surjection. Let C C Y. Prove that f[f~![C]] = C. 


Given a,b € R with a > 0, define the function f: R > R by f(x) = ax+0. Using 
Exercises 16 and 19, prove that f[f~'[U]] = f~[f[U]] for every U CR. 


Let f: X > Y bea function. Let {Vj : 7 € I} be an indexed family of sets where 


tel ie. 


V, CY for all i € I. Prove that my Vii = Wa 


Exercise Notes: Review Remark 1.3.14. Exercise 7 verifies that the converse of 
Exercise 6 is not true for all functions. Exercise 8, however, shows that this converse 
is true for all injective functions. Similarly, Exercise 10 shows that the converse of 
Exercise 9 is not true for all functions. Exercise 11 then shows that this converse is 
true for all surjective functions. 
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1.4 MATHEMATICAL INDUCTION 


Mathematical induction is a potent method of proof that is often applied to establish 
that certain statements are true of all the natural numbers. Before presenting this 
method of proof, we first recall the well-ordering principle. 


1.4.1 The Well-Ordering Principle 


Every nonempty subset of N has a least element. This important property is stated 
as a principle that is routinely applied in mathematical proofs. 


Well-Ordering Principle 1.4.1. Every nonempty subset of N has a least element. 


The well-ordering principle implies the next theorem which is typically covered 
in a discrete mathematics course or an introduction to proof course. 


Definition 1.4.2. For a natural number n > 1, we say that n = p{'pS?---pi* is a 
prime factorization of n, when p;,...,p, are distinct prime numbers and ay,..., dx 


are natural numbers. If pj < p; when 1 <i < j <k, then the factorization shall be 
referred to as an ascending prime factorization. 


Theorem 1.4.3 (Fundamental Theorem of Arithmetic). Let n > 1 be a natural 


number. There exist primes py < po <--- < pp and natural numbers ay, dg,..., a, 
such that n = pi'p5?--- pz. Moreover, if n = gig? ee Th is any ascending prime 
factorization, then =k, py = q1, ..-, Pe = qe and a, = by, ..., ax = be. 


The well-ordering principle also implies that proof by mathematical induction is 
a valid technique of proof (see Exercise 18). 


1.4.2 Proof by Mathematical Induction 


Mathematical induction is a powerful method for proving theorems about the natural 
numbers. Suppose we have a statement about every integer greater than or equal 
to the integer 6b. How does one prove this statement by mathematical induction? 
First prove that the statement definitely holds for 6. Then prove that whenever the 
statement holds for an integer n > 6, then it must hold for the next integer n + 1 
as well. We now present a proof strategy that summarizes proof by mathematical 
induction. 


Proof Strategy 1.4.4. Let b be a fixed integer. To prove a statement of the form 
(Vn > b) P(n) by mathematical induction, use the diagram: 


Base step: Prove P(b). 
Inductive step: Let n > b be arbitrary. 
Assume P(n + 1). 
Prove P(n). 


In a proof by Mathematical Induction, the proof of P(b) is called the base step and 
the proof of (Vn > b)(P(n) > P(n+1)) is called the inductive step. The assumption 
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P(n) is the induction hypothesis and the statement P(n + 1) is the induction 
conclusion. A proof which uses the Principle of Mathematical Induction is called an 
induction proof or proof by induction. 

In the base step of an induction proof, one must show that the statement P(b) is 
true. To do so, simply replace n by b everywhere in P(n) and verify that P(b) holds. 

The inductive step is more challenging as it requires one to show that P(n + 1) 
follows from the assumption P(n). This usually involves expressing the statement 
P(n + 1) in terms that relate to the assumption P(n). Then one can apply the 
assumption P(n), which is referred to as using the induction hypothesis. After 
establishing that P(n + 1) holds, the proof is complete. 

We will prove our next two theorems by mathematical induction. In the proof 
of the following lemma and theorem, we will implicitly apply the Sum and Product 
Principles of Inequality 1.1.8. 


Lemma 1.4.5. If n > 4, then n? > 2n +1. 


Proof. Assume that (x) n > 4. Since n > 0, we conclude that n? > 4n = 2n + 2n. 
From (x), we also see that 2n > 8 > 1. Therefore, n? > 2n + 1. 


Theorem 1.4.6. For every natural number n > 5, 2" > n?. 


Proof. We prove, by mathematical induction, that 2” > n? for all n > 5. 
Base step: For n = 5, we see that 2” = 32 and n? = 25. Thus, 2° > 5?. 


Inductive step: Let n > 5 and assume the induction hypothesis that (IH) 2” > n?. 
We show that 2"+1 > (n+ 1)’. Note that 2"+! = 2.2" =2" +49". Hence 


ney hae by algebra 

5 by the induction hypothesis (IH) 
>n?+22n+1 n?>2n+1 by Lemma 1.4.5 
=(n+1) by factoring. 


>nin 


Hence, 2"+! > (n+ 1)? and the proof is complete. 


Let m <n be integers and let h: {k €Z:m<k<n}—Rbea function. Recall 
the summation notation, or U notation, >> h(i) = h(m) + h(m+ 1) +--- + A(n). 
k= 


=m 
Our next theorem will be applied later in the text. 


Theorem 1.4.7 (Geometric Summation). Let r 4 1 be a fixed real number. For 
rrtl_y 


n 
every integer n > 0, we have > r* = a 


k=0 


n 
Proof. Let r 4 1. We prove that Po (eS a for all n > 0, by induction. 
0+1 


0 —1| 0 
Base step: For n= 0, > k—/,0 — 1] and la 1. Thus, > ae 
k=0 (a k=0 


pOtl_y 
“pl 
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Inductive step: Let n > 0 and assume the induction hypothesis that 


n n+1 
r al 
Sorta at (1H) 
k=0 = 
n+1 a n+1 A 
We show that >> r* = el that is, we show that S> r* = net as follows: 
k=0 os k=0 ’ 
n+1 n 
2 re = (> “) +r”! by property of © notation 
k=0 k=0 
prt —] 
eg es eel r"t! by induction hyp. (IH) 
r— 
n+2 1 
= —— by algebra. 
r—l 
n+l pnt2_4 . 
Hence, re =, and the proof is complete. 


The shift rule, below, allows one to change the lower limit of a sum without 
changing the value of the sum. This rule will be applied several times in the book. 


The Shift Rule 1.4.8. Consider the sum > h(k) with lower limit m. To rewrite 
k=m 
this sum as one with the lower limit s, compute d = s — m and then 


n n+d n+d 
S>h(k) = S> h(k—d) = = h(k — d). 
k=m k=m+d k=s 


Example. Let us shift the sum ae s to an equal sum with lower limit 1. We obtain 

d= 1-3 = —2. We now shift, the upper and lower limit values down by 2 and 
n—2 

modify the summand by shifting k up by 2, to obtain es B= » oe 


In mathematics, the factorial of a natural number n is the product of all the 
natural numbers less than or equal to n. This is written as n! and is called “n factorial.” 


Definition 1.4.9. For each natural number n, the value n! is defined to be 
n! = n(n —1)(n—2)---2-1. 
By convention, we define 0! = 1 (which is used to simplify mathematical formulas). 


Example. Note the following: 


1, =a — 1) fora, 21, 
2. (n+1)! = (n+ 1)n! for n > 0. 
(n+1)! — (n+1)(n)(n— 1)! 
3 (n— 1) = in — 1)! =n(n+1), forn > 1. 


26 m@ Real Analysis: With Proof Strategies 


We now introduce the binomial coefficient Gs an important tool. 


Definition 1.4.10. Let n > k > 0 be integers. Then (7) = Coie 


Remark. Since 0! = 1, we obtain (3) = 1 and (”) = 1, for all integers n > 0. 
Moreover, when n > k > 1, we see that 


(i> nl n(n —1)---(n—(k—1)) 


k} E(n—kl kl 
The binomial coefficient (2) is used frequently in many areas of mathematics. In 
combinatorics, the notation |) is usually read as “n choose k” because there are (2) 
ways to choose an subset of & elements from a set of n elements. 

The following famous theorem identifies the resulting algebraic expansion of the 


expression (x + y)” when n is a natural number, where we let a° = 1 for alla € R, 


Theorem 1.4.11 (Binomial Theorem). Let x and y be real numbers. Then for every 


integer n> 1, (e+ y)"= 3 (par hyk =a" + (Maly +o + (Pay ty" 


From the binomial theorem, we can derive a useful inequality. 
Corollary 1.4.12. Let y > 0. Then (1+y)" >1+4+ n(n) y?, for every integer n > 2. 


Proof. By Theorem 1.4.11, 


when y > 0 and n > 2. 


We end this section with a general description of an inductively defined sequence. 
The first term of such a sequence is identified and thereafter, each successive term is 
defined in terms of the previous term. 


Inductively Defined Sequences. Let M[z] be a “method” or function to obtain a 
number m from another number x, denoted by m = M{za]. Let b be a given number 
and let 7 be a fixed integer. We define an infinite sequence aj, 41, @j42,-..,@n,--- 
of numbers, starting at 7, by induction as follows: 


(2) an = Ma, | tor allay > 4: 


Exercises 1.4 


1. Let 1 <a. Prove that for every natural number n > 1, a < a”. 
*2. Prove that for every natural number n > 1, 2" > n. 


*3. Let a,b € R be positive where a < b. Prove that a” < b”, for every n € N. 


Us 


*9. 


10. 
11. 
12. 
13. 


14. 


15. 


16. 


17. 


*18 


19 
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. Prove that 2” < n! for all n > 4. 
*5. 


(Bernoulli’s inequality) Let x > —1 be a real number. Prove that (1+ 2)” > 
1 + nz for all natural numbers n > 1. 


. Let a1, a2,... be an infinite sequence of real numbers where a, > 0 for all n > 1. 


Prove that (1 + a1)(1+a2)---(1+a@n) >1+a1+a2+--++4a, for alln > 1. 
Let a,b be positive real numbers. Let n > 1 be a natural number. 


(a) Using Exercise 3, prove that a < b if and only if a” < 6". 
(b) Prove that a <b if and only if am <b. 


. Let c < d be real numbers. Let a1, a2,... is a sequence of numbers such that 


¢ <a, < d for all n > 1. Prove that c < *t02tten < q for all n > 1. 


(Telescoping Sum) Let ag,a1,a@2,... be an infinite sequence of real numbers. 
Prove for all natural numbers n > 1, that 


n 
Sia — a;-1) = An — ag. 


i=1 


Prove that for every natural number n> 1,1 + 3+5+4+---+ (2n—1) =n?. 


Prove that for every natural number n > 0, 1+2+2?+4+---4 PQ aot 1, 
Prove that for every natural number n > 1,2+6+18+---+2-3"-! = 3" —1. 
Prove that for every natural number n > 2, (1 — 4) (1-9) --- (1-4) = S. 

n 
Prove that for every natural number n > 1, > k? = Reet YGnt) 

k=1 
n 
1 _ 

Prove that for every natural number n > 1, >) gay = aa 


[Hint: 4(n + 1)? —1 = (2n +1)(2n +3).] 
Prove that for every natural number n > 1, > k-k! =(n+1)!-1. 


Prove that for every natural number n > 1, 


n 


a eee 


k=1 


Let P(n) be a statement that is defined for all integers n > 1. Suppose that 
(a) P(1) is true, and 

(b) for all integers n > 1, if P(n) holds, then P(n + 1) also holds. 

Using the Well-Ordering Principle 1.4.1, prove that P(n) holds for all n € N. 


Let 1 <k <n be integers. Prove that 


("e') )*()) 
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20. Let 1 <k <n be integers. Justify the following sequence of equalities: 


n A, - n re oe 
a (i) ay: (ie ky 
k=0 k=0 


n n—-1 
nm n 
=1 k=0 
n n-1 
— ont+l1 n n—k,k n n—k,k n+l 
=f + oy i: coy | + y>_ ie cy | +y 
k= 


21. Using Exercise 20 in the inductive step, prove Theorem 1.4.11 by mathematical 
induction. 


Exercise Notes: For Exercise 7, no induction is needed. For Exercise 8, multiply 
the induction hypothesis inequality by n and use the assumption c < a,41 < d. For 
Exercise 18, using proof by contradiction, let N > 1 be the least natural number such 
that P(N) is false. Explain why N = +1 for some n > 1 and observe that n < N. 


CHAPTER O 


The Real Numbers 


2.1. INTRODUCTION 


The differential and integral calculus was developed in the 17th century by Gottfried 
Leibniz and Isaac Newton in terms of quantities, called infinitesimals, which were 
infinitely small nonzero values. For example, the derivative was originally defined to 
be a ratio of two infinitesimals. In his calculations, Newton used a number o which, 
being infinitely small, could be multiplied by any real number and still be negligible. 
But it was necessary to divide by o, and so it had to be nonzero. Leibniz’s number 
dx was positive and less than any assignable quantity, yet it was nonzero. Thus, the 
early development of the calculus was based on a number system that was not well 
understood (at the time) by mathematicians, including Newton and Leibniz. 

In the 18th century, the development of the calculus using infinitesimals was 
attacked by the philosopher Bishop Berkeley. He declared that the calculus involved 
a logical fallacy: infinitesimals were treated as though they were equal to 0 and not 
equal to 0. Bishop Berkeley pointed out that the usual way to calculate derivatives, 
requires one to first assume a non-zero infinitesimal and then after a division by this 
infinitesimal, one gets the result by setting it to 0. So two assumptions in direct 
contradiction were being used. Bishop Berkeley comments: 


And what are these same evanescent increments? They are neither finite 
quantities, nor quantities infinitely small, nor yet nothing. May we not call 
them ghosts of departed quantities? 


Berkeley’s criticisms were well-founded and, as a result, mathematicians were 
motivated to abandon infinitesimals and to pursue a logically correct development of 
the calculus. It should be noted that Euler discovered many important theorems that 
are taught in calculus courses. Euler established his results by using infinitesimals in 
a liberal and free-swinging manner. As long as the infinitesimal method led to correct 
results, Euler (and others) argued that the use of infinitesimals must be fundamentally 
sound. Others, however, were much more skeptical and insisted that the infinitesimal 
concept, which produces contradictions, should not be used to validate the calculus. 

Mathematicians eventually discovered a more sophisticated development of the 
calculus, one that avoids the use of infinitesimals. In the 19th century, the foundations 


29° 
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of calculus were eventually formulated axiomatically in terms of limits rather than 
infinitesimals and thus, the debate subsided. Cauchy in the early 19th century and 
Weierstrass, Dedekind, and Riemann in the late 19th century presented this rigorous 
development of the calculus. It is their ideas and proofs that we will cover here, in 
this real analysis text. We will also examine the real numbers, sets of real numbers, 
and real-valued functions that are defined on sets of real numbers. Moreover, we 
will provide a theoretical foundation that validates the fundamental principles of the 
calculus. 

We now begin this validation by identifying the intrinsic properties that charac- 
terize the real number system. In particular, we will focus on the fact that this system 
satisfies the ordered field axioms and the completeness axiom. Using these axioms 
and the fundamental topics discussed in Chapter 1, we can correctly derive all of the 
theorems of the calculus. 


2.2 RIS AN ORDERED FIELD 


Among the first topics covered in a typical real analysis course are the ordered field 
axioms. These axioms form a basis for the algebraic operations and order properties 
upon which the calculus is based. At this point, the real number system (R,+,-,<) 
should be quite familiar. Moreover, the algebraic operations of this number system 
are typically used correctly by most students of mathematics. However, in formal 
mathematics, it is important to identify the essential algebraic properties of the real 
number system that can be used to derive all of its other algebraic properties. These 
essential properties are called the field axioms. Of course, we will not derive all of 
the familiar algebraic properties from the field axioms, but we will demonstrate how 
this can be done with a few examples. 


Field Axioms. The two binary operations + (addition) and - (multiplication) on 
the set of real numbers R satisfy the following axioms (properties): 


Al. Forallz,yE€R,rt+y=yt+s. 
A2. For all z,y,z € R, r+ (y+z) =(a4+y) 42. 
A3. There is a unique number 0 such that for all x € R, r+0=-2. 
A4. For all x € R, there exists a unique y € R such that x + y = 0. (y = —2) 
M1. Forallz,yeER, x-y=y-e. 
M2. For all z,y,z ER, x: (y-z) = (x-y)-z. 
M3. There is a unique number 1 such that 1 4 0 and for alla €R,x-l=vz. 
M4. For all nonzero x € R, there is a unique y € R such that x-y =1. (y= 27!) 
D1. For all z,y,z ER, -(y+z) =(z-y)+(a- 2). 
Axioms Al and M1 are the familiar commutative laws; Axioms A2 and M2 are 
the associate laws; and Axioms A3 and M8 assert that additive and multiplicative 
identity elements exist. The existence of additive and multiplicative inverse elements 


is declared, respectively, by Axioms A4 and M4. Finally, Axiom D1 is called the 
distributive law. 
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Again, the field axioms imply all of the well-known algebraic relationships that 
hold for the real numbers. For example, note that A3 implies that 0 + 0 = 0; and 
M3 implies that 1-1 = 1. Moreover, by A4, —0 is the unique real number such that 
0+ (—0) =0. Since 0+ 0 = 0, we conclude that —0 = 0. 

The proof of our next theorem demonstrates how these axioms can be used to 
derive other familiar properties. As this proof is presented only to illustrate the 
deductive strength of the field axioms, it can be bypassed without loss of continuity. 


Theorem 2.2.1. For all x,y,z € R, the following items hold: 


(a) Ifa+z2=yt+z, then z=y. (d) «-y =O if and only if x =0 or y=0. 
(b). a0 = 0. (e) (—x)y = —xy. 
(c) (-1)-4#=—a. (f) (—#)(-y) = ay. 

Proof. Let x,y,z € R. We now prove items (a)—(d), and leave (e) and (f) as exercises. 


(a) Assume that (x) e+ z2=y+2. By A4, let —z be such that (A) z+ (—z) = 0. 
Then 


(z+ z) + (-z) =(y+2)+(-2) by &) 
x+(z+(-z))=y+(z+(-2)) by A2 
zr+0=y+0 by (a) 

T=y by A4. 


(b) By A383, we have that 0+ 0 = 0. Thus, 


x-(0+0)=2-0 by (x) 
x:-0+2:-0=2-0 by D1 
z-O0+2-0=2-0+4+0 by A3 

x:0=0 by (a). 


(c) We show that x + (—1)-2 =0. Axiom A4 will then imply that (—1)-2 = —z. 


x+(-1)-2@=2+2-(-1) by M1 
=az-1+a-(-1) by M3 
=a-(14+(-1)) byD1 
=2z-0 by A4 
=0 by (b). 


(d) If x =0, then x- y = 0 by (b). If y = 0, then x- y =0 by M1 and (b). Assume 
that «-y = 0. Then Exercise 1 shows that either « = 0 or y = 0. 


In addition to the field axioms, the set of real numbers also satisfies four order 
axioms from which one can derive, together with the field axioms, all of the ordering 
properties that hold in the real number system. 
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Order Axioms. The ordering relation < (less than) on the set of real numbers R 
satisfies the following axioms: 

Ol. For all x,y € R, exactly one of the following holds: x < y, y < x, or x = y. 

O2. For all z,y,z € R, ifx < yand y < z, then x < z. 

O83. For all z,y,z © R, ifa <y,thena+2<y+z. 

O4. For all z,y,z € R, ifa <yandz>0, then x-z<y-z. 


Axiom O1 is called the trichotomy law, and Axiom O2 is said to be the transitive 
law. Because the real number system (R,+,-,<) satisfies the field and order axioms, 
it is said to be an ordered field. 

In the next theorem, we show that two key properties of inequality are derivable 
from the field and order axioms. Again, a proof is given only to support the contention 
that these axioms imply all the familiar properties of the real numbers. It too can be 
bypassed. 


Theorem 2.2.2. For all x,y,z € R, the following items hold: 

(1) « < yif and only if —y < —2. (2) Ifa < yand z <0, thenz-z > y-z. 
Proof. Let x,y,z € R. We now prove items (1) and (2). 

(1) Assume that (W) x < y. We show that —y < —2, as follows: 


x) + (—y)) < yt ((-2) + (-y 


(=a) + ) by (¥) and 03 
(=a) (=) yaa 
+ (—z) 


+ ( ) 
ab )) by Al 


é t)) + (-y) < (y+ (-y)) + (-a) by A2 
0+ (-y) <0+ (—2) by A3 
—y<-2x by Al and A3. 


The converse follows similarly. 
(2) Assume x < y and z < 0. Then 0 < —z by (1) and the identity —0 = 0. So, 


a<y by assumption 
g-(—z) <y-(-2) by 04 
x: ((—l)z) < y-((—1L)z) by Theorem 2.2.1(c) 
\(a-z) <(-1)(y-z) by M2 and M2 
£+z)<—(y-2z) by Theorem 2.2.1(c) 
Yr 2K U-z by (1). 


Recall the standard notation connected with the ordering relation < (less than): 
x >y means that y < x; x < y means that either « < y or x = y; and x > y means 
that either x > y or x = y. 

This ends our discussion of the field and order axioms. In the remaining body of 
the text, we will not directly cite any of the above theorems and we will assume that 
the reader is familiar with the properties of equality and inequality discussed in this 
section and in Section 1.1.3. However, before moving on to the next topic, we need 
to identify one unfamiliar property of inequality that is often applied in real analysis. 
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Theorem 2.2.3. Let x,y € R. Suppose x < y+e for all e > 0. Then x < y. 


Proof. Let x,y € R and assume that x < y+ e for all ¢ > 0. We will prove that 
x < y. Suppose, for a contradiction, that 2 > y. Thus, x — y > 0 and e = *5*% > 0. 
So, x <y+e by our assumption. Since e = *;4% < x — y, we obtain 


t—Y 


csyte=v+( 5 


) <v+(e-y)=2. (2.1) 


Therefore, from (2.1) we conclude that x < x, a contradiction. 


Corollary 2.2.4. If 0 <2 and x <é for all e > 0, then x = 0. 


2.2.1. The Absolute Value Function 


A very important function on the real numbers is the absolute value function. Many 
of the proofs in real analysis apply this function and its properties. In this section, we 
will identify and confirm these properties. The definition of the absolute value was 
given in Section 1.1.4, but we now repeat it here for clarity. 


Definition 2.2.5 (Absolute Value). Given a real number xz, the absolute value of 
x, denoted by |x|, is defined by 


ZT. if x > 0; 
|z| = 
—xz, ife<0. 


For any real number «x, the number |z| can be viewed as the distance, on the real 
line, from x to the point 0. Our next theorem identifies the basic properties of the 
absolute value function that will be regularly applied throughout the text. 


Theorem 2.2.6 (Basic Properties of Absolute Value). For all a,x € R, where a > 0, 
the following hold: 


(a) 0<|2|,¢<|el, —@<|el, |-al=lel. (4) | = [olla 
(bo) jel =O 4f and onlyat = 0, fe) ja =a. 
(c) |z| < aif and only if -a<a2<a. (f) |x + y| < |x| + yl. 


Proof. Items (a)-(e) follow directly from Definition 2.2.5. We shall now prove (f) as 
follows: Since |a + y|? = («+ y)? by (e), we conclude from (a), (d), and (e) that 


ja + yl)? =a? + 2Qay ty? <a? + [2ay| + y? = |x|? +2 |x| [yl + |yl? = (l2] + |yl)?. 


Thus, |x + y|? < (|x| + |y|)?. Theorem 1.1.11 implies that |x + y| < |x| + |y]. 


Item (f) of Theorem 2.2.6 is called the triangle inequality. If we view real 
numbers as points on the real line, then |a — }| is the distance between the points a 
and b. Let c be another point. From the triangle inequality, we can deduce that 


ja — b| = |a—c+c—)| < |a—cl4+|c—9]. (2.2) 
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Thus, |a — b| < ja — c] + |c — a| holds for any real numbers a, b,c. As we will see, the 
derivation in (2.2) is often applied in real analysis. Here are a few more properties 
that are regularly employed. 


Theorem 2.2.7 (More Properties of Absolute Value). For all x,y,k € R, where 
k > 0, we have 


(i) |z| < k if and only if —k<a<k. (iv) |y| — |z| < |z — yl. 
(ii) |x| > & if and only ifa<—-kora>k.  (v) ||2|-[yl| <|e-—y}. 
(iii) |e] — |y| < |2 — yl. 


Proof. Items (i)-(ii) follow easily from Definition 2.2.5. Items (iii) and (iv) imply (v), 
using Definition 2.2.5. We now prove (iii) and (iv). First we prove (iii). Observe that 
|x| = |x — y+ y|. Hence, the triangle inequality implies that 


|z| = |x -—yty| <|z—y|+ ly). 


So, |z| < |x —y| + |y| and thus, |x| — |y| < |x — y|. To prove item (iv), note that 
|x — y| = |y— 2|. Thus, (iii) implies that |y| —|z| < |x — y|. 


Items (iii)-(v) of Theorem 2.2.7 will all be referred to as the backward triangle 
inequality. The triangle inequality and the three versions of the backward triangle 
inequality are important tools that will be used throughout the book. 

Given a finite nonempty set of real numbers A, we let max A, or max(A), denote 
the maximum number in A. Similarly, we define min A, or min(A), to be the minimum 
number in A. For example, max{—1,2,7,3} = and min{—1,2,7,3} = —1. 

The next four results on the absolute value function are applied later in the text. 


Lemma 2.2.8. Let x, a, and b be real numbers. Ifa < x < b, then |z| < max{|a|, |O|}. 


Proof. Assume that a < x < b. First suppose that + > 0. So, as x < b, we conclude 
that b > 0. So, |x| = x and |b| = b. Since x < 6b, we see that |x| < |b| and therefore, 
|x| < max{|a|, ||}. Now suppose that « < 0. Then, because a < x, we see that a < 0. 
So, |z| = —z and |a| = —a. Since a < z, we see that —x < —a and so, |z| < |al. 
Therefore, |a| < max{|a] , ||}. O 


Lemma 2.2.9. Leta<a<banda<y<b. Then |x — y| < |b—- al. 


Proof. Assume (A) a < x <b; anda < y < b. Thus, (4) —b < —y < —a. Adding 
(A) and (@), we obtain —(b-— a) < x—y < (b—a). Thus, |x—y| < |b—a| by 
Theorem 2.2.6(c). 


Corollary 2.2.10. Let a, b, and L be real numbers. 

(1) Ifb >Oanda< L<a+6, then |Z —(a+b)| < |6J. 

(2) Ifb <Oanda+b< L <a, then |Z —(a+b)| < |OJ. 

Proof. To prove (1), let b > 0 and assume that a < L < a+b. Sincea < a+b < a+b, 


Lemma 2.2.9 implies that |Z — (a+ b)| < ja+b—-—a| = |b|. The proof of item (2) is 
left for Exercise 14. 
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Lemma 2.2.11. Let « > 0 and ¢ € R. Suppose that a < b < c. If |Ja—@| < € and 
lc — €| < «, then |b— @| <e. 


Proof. Let ¢ > 0, €€ R, and a < b < c. Assume that |a — ¢| < € and |c—¢| < «. 
Theorem 2.2.7(i) implies that 2—e <a < €+eand (—e<c< +e. Sincea<b<ce, 
it follows that 2—e<b< +e. Thus, |b— | <e. 


Exercises 2.2 


*1. Assume that x-y = 0. To show that either x = 0 or y = 0, assume that x £ 0. 
Thus, x- x2! = 1 by axiom M4. Justify the steps below, as in the proof of 
Theorem 2.2.1, to show that y = 0. 


y=yl 
=l-y 
=(a-a"")-y 
= (ata) -y 
=a -(x-y) 
=z 1.0 
=0 
2. Prove (e) of Theorem 2.2.1. 
3. Prove (f) of Theorem 2.2.1. 
4. Prove Corollary 2.2.4. 
5. Prove (c) of Theorem 2.2.6. 
6. Prove (i) to (iii) of Theorem 2.2.7. 
7. Justify the equalities and inequalities used in the proof of Theorem 2.2.6(f). 
8. Let x and y be real numbers. Show that || — |y| < |¢+ y|. 
9. Let a > 0. Prove that if |e — a] <a, then x > 0. 
10. Suppose that x > 0 and y > 0. Prove that x? + y? < (a +y)?. 
11. Suppose that « < 0 and y < 0. Prove that x? + y? < (x+ y)?. 
12. Suppose that « > 0 and y > 0. Prove that /a Fy < /@+ vy. 
13. Leta<a<banda<y < b. Prove that |x — y| < |b— al. 


*14. Prove Corollary 2.2.10(2). 
15. Let x be between a and b. Let c € R. Show that |x — c| < max{|a — c|, |b— cl}. 
16. Let x and y be real numbers. Show that the following hold: 
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(a) max{2,y}=He+y)+4|o—yl. (b) min{x,y} = Me+y) —4le—9), 
17. Let c € R, cE R, and € > 0. Suppose that |x — c| <«. 
(a) Prove that |x| < e+ |cl. (b) Prove that |c] — € < |z|. 


Exercise Notes: For Exercise 9, use Theorem 2.2.7(i). For Exercise 12, use proof 
by contradiction and Theorem 1.1.10. For Exercise 15, apply Lemma 2.2.8. 


2.3 THE COMPLETENESS AXIOM 


In this section we present the completeness axiom, which is an assertion that implies 
that there are no gaps or holes in the real number line. We shall see that this axiom 
identifies an essential property of the real number system that will allow us to prove 
the most important theorems in the calculus; for example, the Cauchy Convergence 
Criterion, the Intermediate Value Theorem, and the Fundamental Theorem of Cal- 
culus. Before stating the completeness axiom, we need a few preliminary definitions. 


Definition 2.3.1 (Upper Bound and Lower Bound). Let S C R be nonempty. 


e Suppose there is a b € R such that every real number in S' is less than or equal 
to 6, that is, for all x € S, we have x < b. Then we shall say that b is an upper 
bound for S and that S is bounded above. 


e Suppose there is an a € R such that a is less than or equal to every real number 
in S, that is, for all « € S, we have a < x. Then we will say that a is a lower 
bound for S and that S' is bounded below. 


e If S has both a lower bound and an upper bound, then we say that S is bounded. 


Let S = {2 : n € N} (see Figure 2.1). Since 2 < 3 for all n € N, we see that 
the set S is bounded above. Furthermore, because 0 < 2 for all n € N, the set S is 
bounded below. 


sine 
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Figure 2.1: The set {2 : n € N} plotted on the real line. 


We present a few more examples. 


1. Let a,b € R be such that a < b. Then 0 is an upper bound and a is a lower bound 
for each of the sets (a,b), [a, 6), [a,b]. 

. Z* and Q* are not bounded above. 

. —2 is a lower bound for the sets Z* and Q?*. 

s Theset {27a = Q*} is bounded, because 0 < sui <lforallze Qt. 

. The set {x € R: x < 0} is bounded above, but it is not bounded. 


o F WwW bw 


The following result is often applied to show that a set of real numbers is bounded. 
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Theorem 2.3.2. Let S C R be nonempty. Then S is bounded if and only if there is 
an M >0 so that |z| < M for allze S. 


Proof. Let S CR be nonempty. Assume that S is bounded. Thus, there are nonzero 
real numbers a,b such that a < x < 6 for all x € S. Let M = max{|a| ,|b|} > 0. By 
Lemma 2.2.8, |x| < M for all  € S. To prove the converse, assume that |x| < M for 
all € S, where M > 0. Thus, —-M <2 < M for all x € S. So S' is bounded. 


For example, let S = {x € R: |3a2 — 1| < 5}. We will apply Theorem 2.3.2 to show 
that S is bounded. For each x € S we have that |3x — 1| < 5. Thus, by the backward 
triangle inequality, we conclude that |3z|—1 < |3x — 1] < 5. Hence, 3|z|-—1 <5 and 
so, |x| < 2. Therefore, |x| < 2 for all x € S and S is bounded. 

A set S that is not bounded is said to be unbounded. By Theorem 2.3.2, a set 
S is unbounded if and only if for all M > 0 there is an x € S such that |x| > M. 

Let 6 be an upper bound for a nonempty set S. So, x < b for all « € S. It follows 
that any number greater than b is also an upper bound for S. Our next definition 
will allow us to focus on the smallest, or least, upper bound for S. 


Definition 2.3.3 (Least Upper Bound and Greatest Lower Bound). Let S C R be 
nonempty. 


e Suppose that @ is an upper bound for S. If 6 is the least upper bound for S, 
then is said to be the supremum of S' and we write 3 = sup(S). 


e Suppose that a is a lower bound for S. If @ is the greatest lower bound for S, 
then a is called the infimum of S and we write a = inf(S). 


Example. Consider the set S = {2 : n € N} (see Figure 2.1). As we saw earlier, the 
set S is bounded. We note that 0 is the greatest lower bound for S and that 2 is the 
least upper bound for S. Thus, inf(S) = 0 and sup(S) = 2. 


Let S C R be nonempty. The equation 3 = sup(S) means that the following two 
items hold: (a) 6 is an upper bound for S and (b) 6 is the smallest upper bound 
for S. Similarly, the equation a = inf(S) means that (a) a is a lower bound for S and 
(b) a is the largest lower bound for S. The next remark repeats and clarifies these 
observations. This remark will be used to identify two important proof strategies. 


Remark 2.3.4. Let S C R be nonempty and let a,@ € R. Then we have the 
following (see Figure 2.2): 


1. 6 =sup(S), if and only if the following two conditions hold: 


(a) For alla eS, a < B. 
(b) For all real numbers 8, if 6 is an upper bound for S, then 6 < b. 


2. a = inf(S), if and only if the following two conditions hold: 


(a) For allae S,a<z. 
(b) For all real numbers a, if a is a lower bound for S, then a < a. 
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Figure 2.2: Illustrations for parts 1 and 2 of Remark 2.3.4, respectively. 


Before we continue, we need to distinguish between the notions of a maximum 
element and the supremum. We have defined the maximum and minimum elements 
of a nonempty finite set of real numbers (see page 34). Now, let S be an infinite set 
of real numbers. Suppose that there exists an m € S such that « < m for all x € S. 
Thus, m is a maximum element of S. Also observe that m = sup(S). It turns out 
that S has a maximum element if and only if sup(S) € S. Thus, if sup(S) ¢ S, then 
S does not have a maximum element. A similar observation can be made concerning 
the minimum and infimum of S. 


Definition 2.3.5 (Maximum Element and Minimum Element). Let S C R. 


e Let G = sup(S). If 6 € S, then 8 is the maximum element of S, denoted by 
6 = max(S). 


e Let a = inf(S). If a € S, then a is the minimum element of S, denoted by 
a = min(S). 


For example, let S = [2,5]. Because sup(S) = 5 and 5 € S, we observe that 
max(S) = 5. Since inf(S) = 2 and 2 € S, we see that min(.S) = 2. For another 
example, let T = (2,5). Because sup(T) = 5 and 5 ¢ T, we have that max(T) 
is undefined; that is, T does not have a maximum element. Similarly, min(7) is 
undefined. Finally, let S = {2 : n € N} (see Figure 2.1). We see that max(S) = 2 
and min(S’) is undefined. 

We are now ready to present one of the most important properties of the set 
of real numbers, namely, the completeness axiom. This axiom ensures that every 
nonempty set of real numbers that is bounded above has a supremum. As we will see, 
the completeness axiom forms the basis for all the limit concepts that are applied 
throughout the text. 


Completeness Axiom. Every nonempty set of real numbers that is bounded above 
has a least upper bound. 


The completeness axiom just asserts that if a nonempty set S C R is bounded 
above, then there is a real number ( that satisfies the equation 3 = sup(S). Thus, the 
completeness axiom implies that there are no gaps, or “missing points,” in the set of 
real numbers; however, there are number systems that do have gaps. For example, let 
€é€R, R* =R\ {é}, and S C R* be defined by S = {x € R*: x < £}. So sup(S) = ¢ 
and ¢ ¢ R*. Thus, R* has a gap and R* does not satisfy the completeness axiom. 
Moreover, the number system Q also has gaps (see Remark 2.4.7). Thus, Q fails to 
satisfy the completeness axiom as well. 
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Of course, the completeness axiom is very important; but, rather than just accept 
the axiom, can one actually prove it? The answer is yes. The proof requires one to first 
construct the set R of real numbers from the rational numbers. Such a construction 
is based on set theoretic tools that were independently developed by Georg Cantor 
and Richard Dedekind. Using these tools, Cantor and Dedekind were able to prove 
the completeness axiom. However, we will not present such a proof as it would take 
us too far astray. Thus, we will just accept the axiom and pursue its consequences. 


2.3.1 Proofs on the Supremum of a Set 
Remark 2.3.4(1) inspires a strategy for proving equations of the form sup(S) = £. 


Proof Strategy 2.3.6. Given a real number 6 and a nonempty S C R, to prove 
that sup(S) = 6 we may use the two-step proof diagram: 


Step (1): Prove x < @ for allxe S. 
Step (2): Assume 6 is an upper bound for S. 
Prove 2 < 6. 


In other words, to prove that sup(S) = 6 you must first prove that 6 is an 
upper bound for S' and then prove that {§ is the smallest upper for S. Similarly, 
Remark 2.3.4(1) yields a strategy that allows one to take advantage of an assumption 
having the form sup(S) = £. 


Assumption Strategy 2.3.7. Let S C R be nonempty and let 6 € R. Suppose that 
you are assuming that sup(.S) = 3. Then (1) x < @ for all x € S, and (2) whenever 
b is an upper bound for S, you can conclude that 6 < b. 


Our proof of the next lemma employs both proof strategy 2.3.6 and assumption 
strategy 2.3.7. Given a set S C R and a real number k € R, we can form the new set 
of real numbers defined by A = {kx : x € S}. The set A is often denoted by kS. 


Lemma 2.3.8. Let S C R be nonempty and bounded above, and let k > 0. Define the 
set A by A= {kx : x € S}. Then the set A is bounded above and sup(A) = ksup(S). 


Proof Analysis. Let S C R be bounded above. Hence, sup(S) = @ exists by the 
completeness axiom. Let k > 0. We must prove that sup(A) = &3. There are two 
cases to consider: k = 0 and k > 0. When k = 0 the proof follows easily (see proof 
below). Let k > 0. First note that every element in A has the form kx for some x € S. 
So to prove that 3 is an upper bound for A, we must show that kx < k@ for all 
x € S. Appealing to proof strategy 2.3.6, we construct the following proof diagram: 


Assume ( = sup(S). 
Prove kx < kG for alla € S. 
Assume c is an upper bound for A. 
Prove k8 < c. 


Given that 3 = sup(S), we can use assumption strategy 2.3.7. Also note that if c is 
an upper bound for A, then ka < c for all x € S. We now have all of the ingredients 
required to write a correct proof of the lemma. 
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Proof of Lemma 2.3.8. Let S C R be nonempty and bounded above. Thus, by the 
completeness axiom, 3 = sup(S) exists. Thus, « < 6 for all « € S. Let k > 0. We 
note that every element of A = {ka : a € S} has the form kx for some x € S. There 
are two cases to consider. 


CASE 1: Assume k = 0. Then A = {0} and we clearly have sup(A) = ksup(S). 


CASE 2: Assume k > 0. We first prove that k@ is an upper bound for A. Since x < 6 
for all x € S and k > 0, we have that kx < k§ for all « € S. Therefore, kG is an 
upper bound for A. Let c be an upper bound for A. We shall prove that kG < c. Since 
cis an upper bound for A, we have that kx < c for all x € S. As k > 0, we see that 


x < _ for all x ¢ S. Thus, ¢ is an upper bound for S. Since § is the smallest upper 
bound for S, we conclude that 6 < £ and so, k3 < c. Therefore, sup(A) = kf. 


2.3.2 Proofs on the Infimum of a Set 
Remark 2.3.4(2) motivates a strategy for proving equations of the form inf(S) = a. 


Proof Strategy 2.3.9. Given a real number a and a nonempty S C R, to prove 
that inf(S) = a we may use the two-step proof diagram: 


Step (1): Prove a < x for alla eS. 
Step (2): Assume a is a lower bound for S. 
Prove a <a. 


Let S' be a nonempty set of real numbers that is bounded below. The completeness 
axiom makes no explicit assertion that the infimum of S exists; however, using the 
axiom one can prove that this infimum exists. In particular, the proof of the following 
theorem shows that the completeness axiom implies that there is a real number a 
that satisfies the equation a = inf(S). Before reading the proof of this theorem, one 
should read Remark 2.3.4 and proof strategy 2.3.9. 


Theorem 2.3.10. Every nonempty S C R that is bound below, has a greatest lower 
bound. 


Proof. Let S C R be nonempty with lower bound a. Let S* = {—x: x € S}. Then 
every element of S* has the form —a for an x € S. Since a is an lower bound for S, 
we have a < x for all « € S. Thus, —x < —a for all x € S. Hence, —a is an upper 
bound for S*. By the completeness axiom, sup(.S*) = 6 exists. Thus, —x < § for all 
x € S, and G is the smallest such upper bound. 

We now prove that inf(S) = —(. First we prove that — is a lower bound for S. 
As —x < 6 for all x € S, it follows that —@ < x for all x € S. Thus, —( is a lower 
bound for S. To prove that —@ is the largest lower bound for S, let a be a lower 
bound for S. By the argument in the first paragraph, we see that —a is an upper 
bound for the set S*. Since 6 is the least such upper bound for S*, it follows that 
B < —a. Hence, a < —§. Therefore, inf(S) = —{. 


Assumption Strategy 2.3.11. Let S C R be nonempty and let a € R. Suppose 
that you are assuming that inf(S) = a. Then (1) a < x forall xz € S, and (2) whenever 
a is a lower bound for S, you can conclude that a < a. 
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The proof of the next lemma shows how to employ strategies 2.3.9 and 2.3.11. 
Moreover, this proof is analogous to Case 2 in the proof of Lemma 2.3.8; however, 
“the inequalities are reversed.” 


Lemma 2.3.12. Let S C R be nonempty and bounded below, and let k > 0. Let 
A= {kx: x € S}. Then A is bounded below and inf(A) = kinf(S). 


Proof Analysis. Since every element of A = {kx : x € S} has the form kx for an 
x € S, the logical structure of our proof will be as follows: 


Assume a = inf(S). 
Prove ka < kx for all x € S. 
Assume c is an lower bound for A. 
Prove c < ka. 


We now present a correct proof of the lemma. 


Proof. Let S C R be nonempty and bounded below. Theorem 2.3.10 implies that S' 
has a greatest lower bound a. So a = inf(S). Thus, a < « for all x € S. Let k > 0. 
First we prove that ka is a lower bound for A. Since a < x for all x € S and k > 0, 
we have that ka < kx for all x € S. Therefore, ka is a lower bound for A. 

Let c be a lower bound for A. We will prove that c < ka. As c is a lower bound 
for A, we see that c < ka for all x € S. Since k > 0, we have that ¢ < x forallze S. 
Thus, ¢ is a lower bound for S. As a is the largest lower bound for S, we conclude 
that ¢ <a. Hence, c < ka and therefore, inf(A) = ka. 


We can now formally combine the results of Lemmas 2.3.8 and 2.3.12. 


Theorem 2.3.13. Let S C R be nonempty and bounded, and let & € R. Define the 
set kS by kS = {kx : x € S}. Then the set kS is bounded and 


(a) if k > 0, then sup(kS) = ksup(S) and inf(kS) = kinf(S), 
(b) if k < 0, then sup(kS) = kinf(S) and inf(kS) = ksup(S). 


Proof. Lemmas 2.3.8 and 2.3.12 imply (a). The proof of (b) is left for Exercise 9. 


Our attention will now turn to a concept that plays a prominent role in real 
analysis, namely, the concept of a bounded function. 


Definition 2.3.14. A function f: D — R is bounded if the set 
f(D] ={f(z):2€ D} 
is bounded. If S C D and f[S] is bounded, then f is said to be bounded on S. 
If f: D > R is bounded, then 8 = sup(f[D]) and a = inf(f[D]) exist, and 
a < f(x) < 6 for all x € D (see Figure 2.3). 
For example, let D = (0,2) and f: D — R be defined by f(x) = x?. Then f 


is bounded. Furthermore, f[D] = (0,4), sup(f[D]) = 4, and inf(f[D]) = 0. On the 
other hand, the function g: D + R defined by g(x) = 4 is not bounded. 
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8 = sup(f[D]) > 


<—— f(z) 


f[D] 


graph of f 


Figure 2.3: Illustration of a bounded function. 


Remark 2.3.15. For a function f: D — R, Theorem 2.3.2 and the definition of 
f|D| imply that there are two equivalent ways of saying that f is bounded; namely: 


(1) there are real numbers a,b such that a < f(x) < 6 for all x € D, 
(2) there is a real number M > 0 such that |f(x)| < M for all x € D. 


Our next theorem establishes a supremum and infimum relationship between two 
bounded functions and their sum. This relationship will be used to prove a sum rule 
for integration (see Lemma 6.2.4). 


Theorem 2.3.16. Let f: D— Rand g: D > R be bounded. Define (f+g): DR 
by (f + 9)(x) = f(x) + g(x) for all x € D. Then (f +g): D> R is bounded, and 


(a) sup((f + g)[D]) < sup(f[D]) + sup(g[D]), 
(b) inf(f[D]) + inf(g[D]) < inf((f + g)[D]). 


Proof. Let f: D—> Rand g: D > R be bounded. Let M; > 0 and Mz > 0 be such 
that |f(x)| < Mi and |g(x)| < Mp for all x € D. By the triangle inequality we obtain 


(f+ 9)(x)| = |F(@) + 9(@)| < | F(®)| + |9(@)| < Ma + Mp 


for all x € D. Hence, (f +g) is bounded. We prove (b) and leave (a) for Exercise 17. 
Since f[D], g|D], and (f + g)[D] are bounded, let ¢ = inf(f[D]), 6 = inf(g[D]), and 
7 = inf((f+4g)[D]). Thus, (*) e¢ < f(x) and (**) 6 < g(x), for all x € D. Inequalities 
(*) and («*) imply that ¢+6 < f(x) + g(x) for all e € D. Thus, e+ 6 < (f + g)(z) 
for all a € D. Hence, ¢ +6 is a lower bound for (f + g)|D]. Therefore, ¢+6 < 7, that 
is, inf(f[D]) + inf(g[D]) < inf((f + 9)[D)). 


The inequalities in Theorem 2.3.16 can be strict (see Exercise 12). We end this 
section with two more theorems that involve the supremum and infimum. Viewing 
Figure 2.4 below, the first theorem is very easy to believe and understand. This result 
will allow us to prove a result concerning the supremum and infimum of Riemann 
sums (see Theorem 6.1.17). 


Theorem 2.3.17. Let A C R and B CR be non-empty. Suppose for all x € A and 
all y € B, we have that x < y. Then A is bounded above, B is bounded below, and 
sup(A) < inf(B). 
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Proof. Suppose that 
x<y forallze Aandall ye B. (2.3) 


Let y € B be arbitrary. From (2.3), we have that x < y for all  € A. Thus, 
y is an upper bound for A (see Figure 2.4). The completeness axiom implies that 
7 = sup(A) exists. Since y is an upper bound for A and ¥ is the least such upper 
bound, we conclude that y < y. Because y € B was arbitrarily chosen, it follows that 
y < y for all y € B. Thus, y is a lower bound for B. Theorem 2.3.10 implies that 
6 = inf(B) exists. As y is a lower bound for B and 6 is the greatest such lower bound, 
we see that y < 6. Therefore, sup(A) < inf(B). 


Figure 2.4: An illustration for the proof of Theorem 2.3.17. 


The proof of Theorem 2.3.17 adapts to prove (see Exercise 16) our next result 
which will enable us to prove that a particular type of composite function is integrable 
(see Theorem 6.2.14). 


Theorem 2.3.18. Let A C R be non-empty. Let c € R. Suppose for all x € A and 
all y € A, we have that x — y < c. Then A is bounded and sup(A) — inf(A) < c. 


2.3.3 Alternative Proof Strategies 


The proofs of our earlier results on the supremum and infimum relied on Remark 2.3.4 
in which we observed that the equation @ = sup(.S) means that (a) 6 is an upper 
bound for S and (b) @ is the smallest upper bound for S. Given (a), another way 
to say (b) is that “every number r < § is not an upper bound for S.” A similar 
observation can be made concerning the infimum ofa set. We formally record these 
observations in the following remark. 


Remark 2.3.19. Let S C R be nonempty and let a, 6 € R. Then (see Figure 2.5) 


1. 6 =sup(S), if and only if the following two conditions hold: 


(a) For alla eS, a < £. 
(b) For all real numbers r, if r < 6, then there is an x € S so that r < a. 


2. a = inf(S), if and only if the following two conditions hold: 


(a) For allze S,a<z. 
(b) For all real numbers q, if a < q, then there is an x € S so that x < q. 
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Figure 2.5: Illustrations for parts 1 and 2 of Remark 2.3.19, respectively. 


Remark 2.3.19(1) motivates the following alternative strategies for dealing with 
an equation of the form sup($) = /. 


Proof Strategy 2.3.20. Given a real number 6 and a nonempty S C R, to prove 
that sup(S) = 6 we may use the two-step proof diagram: 


Step (1): Prove x < 6 for allxze S. 
Step (2): Let r < £. 
Prove r < x for some z € S. 


Assumption Strategy 2.3.21. Let S C R be nonempty and let 6 € R. Suppose 
that you are assuming that sup(S) = 8. Then (1) « < 6 for all x € S, and (2) 
whenever r < ( there is an « € S such that r < a. 


We will present two proofs of our next theorem. In the first proof, we shall apply 
the above alternative strategies 2.3.20 and 2.3.21. In our second proof, we apply 
strategies 2.3.6 and 2.3.7. 


Theorem 2.3.22. Let A and B be nonempty subsets of R that are both bounded 
above. Let C= {x +y: x € Aand y € B}. Then C is bounded above and 


sup(C) = sup(A) + sup(B). 


Proof Analysis. Let A and B be bounded above. Thus, a = sup(A) and 6 = sup(B) 
exist by the completeness axiom. So, we must prove that sup(C) = a+ 3. First note 
that every element in C' has the form x + y for some x € A and y € B. So to prove 
that a+ @ is the least upper bound for C’, we must fulfill the following proof diagram: 


Assume a = sup(A) and 8 = sup(B). 
Prover+y<a+6 forallxe Aandy€ B. 
Let r<a+Q8. 

Prove r<a2+y for some x € Aand ye B. 


We now present a correct proof of the theorem. 


First Proof of Theorem 2.3.22. Let A and B be as stated. Thus, let a = sup(A) and 
6 =sup(B). Let z € C. So z=2+y for some x € A and some y € B. As x < a and 
y < B, we see that z = (x+y) < (a+ B). So a+ @ is an upper bound for C. 

To prove that a+ is the least upper bound for C, let r << a+. Thus, r—6 < a. 
Since a = sup(A), there is an x € A such that r — 8 < x. Thus, r—2z < £. Since 
(G = sup(B), there is an y € B such that r—a < y. Hence, r<ax+yandr+yeC. 
Therefore, sup(C) = a+ 8. 
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We will now give another proof of Theorem 2.3.22 using strategies 2.3.6 and 2.3.7. 
Using these strategies, the logical structure of our second proof will be as follows: 


Assume a = sup(A) and 3 = sup(B). 
Prover+y<a+Q forall2é Aandye€ B. 
Let c be an upper bound for C. 

Provea+ 6 <e. 


Second Proof of Theorem 2.3.22. Let A and B be as stated, and let a = sup(A) and 
6 =sup(B). Let z € C. So z= 2+ y for some x € A and some y € B. Since tx < a 
and y < 6, we have that z = (x + y) < (a+ 8). So a+ @ is an upper bound for C. 

To prove that a+ £ is the least upper bound for C, let c be an upper bound for 
C’. We shall prove that a+ 6 < c. Since c is an upper bound for C’, we have that 


e+y<cforalle¢ Aandallye€ B. (2.4) 


Let y € B be arbitrary. By (2.4), we have that + y < c for all e € A. Hence, 
x<c-—y for all x € A. Thus, c— y is an upper bound for A. Since a = sup(A), we 
infer that a < c—y. Since y € B was arbitrarily chosen, it follows that y < c— a for 
all y € B. Therefore, c— a is an upper bound for B. Since 6 = sup(B), we conclude 
that 6 <c—a. Thus, a+ 8 <c. Therefore, sup(C) = a+ 8. 


Remark 2.3.19(2) also provides us with alternative strategies for working with an 
equation of the form inf(S) = a. 


Proof Strategy 2.3.23. Given a real number a and a nonempty S C R, to prove 
that inf(S) = a we may use the two-step proof diagram: 


Step (1): Provea <x forallxeS. 
Step (2): Let a <q. 
Prove « < q for some x € S. 
Assumption Strategy 2.3.24. Let S C R be nonempty and let a € R. Suppose 


that you are assuming that inf(S) = a. Then (1) a < a for all x € S, and (2) 
whenever a < q there is an « € S such that x < q. 


Exercises 2.3 


1. For each of the following subsets S of R, answer the questions: Is the set S 
bounded above? Is the set S bounded below? 


(a) S = [2,5] (b) S = [2,5) 

(c) S = (2,00) (d) S=N 

(e) (eR: (x? +1)-1 > 3} (f) S={4+:neEN} 

(g) S={qEQ:0<q< V2} (h) S={x ER: [2x41] < 5}. 
2. For each subset S of R, identify the sup(S) and inf(S), if they exist. 


(a) S = [2,5] ( 

(c) S = (2,00) (d 

(ec) S={eeER: (2? +1) 1'>5} WS= ne n € N} 

(g) S={qEeQ:0<q< v2} (h) S={x ER: |2e +1] <5}. 


b) S= 2.5) 
)s 
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. For each subset S of R, identify the max(.S) and min(S) if they exist. 


(a) S = (2,5,6} (b) S=N 

(c) S=(2,00)  (d) S={gEQ:0 <a < V3. 
Suppose S C R is nonempty and bounded. Let A C S be nonempty. Prove that 
A is bounded. Then prove that sup(A) < sup(S) and inf(S) < inf(A). 


. Let A C R and B C R be non-empty bounded sets. Suppose that sup(A) = 


inf(B) = @. Prove that x << y foralla € Aandally€ B. 


. Let AC Rand B CR be non-empty sets. Suppose that B is bounded above and 


that for all « € A there is a y € B such that x < y. Prove that A is bounded 
above and sup(A) < sup(B). 


. Suppose S C R is nonempty and bounded. Let 3 = sup(S). Prove that for all 


€ > 0, there exists an x € S such that G-—e <a. 


. Suppose S C R is nonempty and bounded. Let a = inf(S). Prove that for all 


€ > 0, there exists an x € S such that x <a+e. 


. Complete the proof of Lemma 2.3.13, that is, suppose that S C R is bounded 


and nonempty let k < 0. Prove that 
(a) sup(kS) = kinf(S) (b) inf(kS) = ksup(S). 


. Let S* = {a € R: ais a lower bound for S} where S C R is nonempty and 


bounded below. Prove that S* is bounded above and sup(S*) = inf(S). 


Let f: D— R be bounded and k € R. Define (kf): D > R by (kf) (x) = kf (a) 
for all « € D. Prove that (kf) is bounded, (kf)[D] = kf[D], and 


(a) if k > 0, then sup(Af[D]) = ksup(f[D]) and inf(kf[D]) = k inf(f[D]) 

(b) if k <0, then sup(kf|D]) = kinf(f[D]) and inf(kf[D]) = ksup(f[D)). 
Find functions f: D— R and g: D > R such that 

(a) (f+g)[D] 4 f[D]+g[D] where f[D]+g[D] = (f(x) +9(y): # € D& ye D}, 
(b) sup((f + g)[D]) < sup(f[D]) + sup(g[D)). 


Let S C R be nonempty and bounded, and let k € R. Define A = {k+ 2:2 € S}. 
Prove that 


(a) sup(A) = &+sup(S), (b) inf(A) = k + inf(S). 


Let S be a bounded (nonempty) subset of R, k > 0, and c € R. Let A be the set 
A={kx+c:a € S}. Prove that A is bounded and 


(a) sup(A) = ksup(S) +c, (b) inf(A) = kinf(S) +c. 

Let S CR and T CR be nonempty and bounded. Show that SUT is bounded 
and then prove that 

(a) sup(S UT) = max{sup(S), sup(T)}, 

(b) inf(S UT) = min{inf(S), inf(T)}. 

Prove Theorem 2.3.18. 
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*17. Prove part (a) of Theorem 2.3.16. 


18. Let S C R* and T C R™ be nonempty and bounded above. Consider the set 
P={xy:xc€S and y €T}. Prove that sup(P) = sup(S) - sup(Z). 


19. Let f: D— R be bounded. Let E C D be nonempty. Prove that the set f[£] is 
bounded. Then prove that sup(f[E]) < sup(f[D]) and inf(f[D]) < inf(f[£)). 


20. Let f: D — R and g: D > R be bounded. Assume that f(x) < g(x) for all 
x € D. Prove that sup(f[D]) < sup(g[D]) and inf(f[D]) < inf(g[D]). 


21. Let A and B be nonempty subsets of R which are both bounded below. Define 
the set C = {x+y:a€ Aand y € B}. Prove that C is bounded below and that 
inf(C) = inf(A) + inf(B). 


22. Let S C R* and T C Rt be nonempty, and let P = {ry: 2 € Sandy € T}. 
Prove that inf(P) = inf(S) - inf(T). 


23. Let f: D— Rand g: D > R be functions such that f(x) < g(y) for all x,y € D. 
Prove that sup(f[D]) < inf(g[D)]). 


Exercise Notes: For Exercise 1, |2x + 1| < 5 is equivalent to —5 < 24+1 < 5. 
For Exercises 7 and 8, see Remark 2.3.19. For Exercise 9, read the proof of Theo- 
rem 2.3.10. For Exercise 11, apply Theorem 2.3.13. For Exercises 13 and 14 review the 
analysis and proofs for Lemmas 2.3.8 and 2.3.12. For Exercise 18 review the analysis 
and proof of Theorem 2.3.22. For Exercise 21, review strategies 2.3.23 and 2.3.24. For 
Exercise 22, first show that a = inf(S) > 0 and 6 = inf(T) > 0; and review strate- 
gies 2.3.9 and 2.3.11. Let c be a lower bound for P. If c < 0, then clearly c < af. 
Suppose c > 0 and let y € T. Prove that 7 is a lower bound for S. 


2.4 THE ARCHIMEDEAN PROPERTY 


In this section, we will explore some consequences of the completeness axiom. The 
first such consequence is called the Archimedean property, which asserts that every 
real number is strictly less than some natural number. Of course, this seems to be 
quite obvious, but this property actually depends on the completeness axiom. 


Theorem 2.4.1 (Archimedean Property of R). For each x € R, there is ann € N 
such that # <n. 


Proof. Let « € R. We prove that there exists a natural number n € N such that « < n. 
Suppose, for a contradiction, that n < x for alln € N. Thus, N C R is bounded above. 
By the completeness axiom, ( = sup(N) exists. Since 6 — 1 < 6, there is anim € N 
such that G — 1 < m (see Remark 2.3.19(1-b)). Therefore, 6 <m+1l=ne€EN; 
contradicting the fact that $ is an upper bound for N. This completes the proof. 


In our next theorem, we show that the Archimedean property implies two useful 
results. The first result states that for any two positive real numbers there is an 
integer multiple of one that exceeds the other. The second result tells us that for any 
positive real number wz, there is a natural number whose reciprocal is less than x. 
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Theorem 2.4.2. Each of the following statements hold: 


(a) For alle € Rand y ER, if x > 0, then there is an n € N such that y < na. 
(b) For all z ER, if x > 0, then there is an n € N such that 0 < 1/n <a. 


Proof. We first prove (a). Let x,y € R where x > 0. Consider the real number y/z. 
By Theorem 2.4.1, there is an n € N such that y/x < n. Thus, y < nz. To prove (b), 
let x > 0. From (a) (where we take y = 1), we conclude that there is an n € N such 
that 0< 1< nz. Thus, 0<1/n<z. 


The Archimedean property also implies the following two lemmas, the latter of 
which will be used in the next section. These two lemmas should be obvious to anyone 
who is familiar with the elementary properties of the real numbers; nevertheless, we 
will formally prove these two results. 


Lemma 2.4.3. For all x € R, there exists an integer m such that m—1< a2 < m. 


Proof. Let « € R. If x € Z, then m=2+1€Zandm—1<2<™m. Let x ¢ Z. So 
x #0. First suppose that + > 0. If x < 1, then m = 1 is as required. If x > 1, then 
(by Theorem 2.4.1 and the Well-Ordering Principle 1.4.1) let m be the least natural 
number such that « < m. Since the natural number m — 1 is less than m, it follows 
that m—1< <a. Therefore,m—1<a2<m. 

Now suppose that « < 0, then —x > 0 and the argument in the previous paragraph 
implies that there is an n € N such that n -—1 < —ax < n. Thus, by applying 
properties of inequalities, we conclude that —n < x < 1—n. Since x ¢ Z, we see that 
—n<a<l-n. Let m=1-n. Thus, m € Z,m-1=—-nandm—-1<a<m. 


Lemma 2.4.4. For all a,b € R, if 1 < 6—a, then there exists an m € Z such that 
a<m<b. 


Proof. Let a,b € R. Assume that 1 < b—a. Since 1 < b—a, we have that (i) 1+a < b. 
By Lemma 2.4.3, there is an m € Z such that (ii) m— 1 < a and (iii) a < m. Note 
that (i) and (ii) imply that m < b. Thus, (iii) implies that a << m < b. 


2.4.1. The Density of the Rational Numbers 


The completeness axiom, by means of the Archimedean property, implies that the 
set of rational numbers is dense in R; that is, between any two real numbers there 
exists a rational number. 


Definition 2.4.5. Let D C R. We say that D is dense in R, if for all z,y € R, if 
x <y, then there exists a d € D such that x <d< y. 


Theorem 2.4.6. The set Q of rational numbers is dense in R. 


Proof. Let x,y € R be such that x < y. Therefore, (y— x) > 0. Theorem 2.4.2(a) 
implies that there is ann € N such that 1 < n(y—z). Thus, 1 < ny—na. Lemma 2.4.4 
states that there is an m € Z such that nt < m < ny. Thus, x < ™ < y and 
q= % € Qis as required. 
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Theorem 2.4.6 implies that every open interval contains infinitely many rational 
numbers. So no matter where one may be on the real number line, there are rational 
numbers nearby. Thus, one may be tempted to conclude that there are just as many 
rational numbers as there are real numbers. However, in the next section, we will show 
that the set R of real numbers is much larger than the set of rational numbers Q. 


Remark 2.4.7. Let S = {1 € Q: a2 < V2} and T = {1 €Q: 24> Val. 
Theorem 2.4.6 implies that sup($) = V2 and inf(T) = V2. Since V2 ¢ Q, it follows 
that there is a “gap” in the set of rational numbers (see page 38). In fact, if we 
view the set of rational numbers as a number line, then this line is quite porous, 
as it contains many gaps-one for each irrational number. On the other hand, the 
completeness axiom ensures that the set of real numbers has no gaps, of any kind. 


Using Theorem 2.4.6, we will next show that the set of irrational numbers is also 
dense in R. First we present a simple property about the product of two real numbers, 
one of which is rational and the other is irrational (see Exercise 3). 


Lemma 2.4.8. Let x € Q be nonzero and y € R be irrational. Then xy is irrational. 


Theorem 2.4.9 (Density of the Irrationals in R). For all x,y € R, if x < y, then 
there exists an irrational w such that « <w < y. 


Proof. Let x,y € R be such that x < y. Therefore, FB < Wor Theorem 2.4.6 implies 


there is a nonzero gq € Q such that 0) <q< Roe Hence, « < qV2 < y. Since V2 is 


irrational, Lemma 2.4.8 implies that qv/2 is irrational. 


Thus, the set of rational numbers Q and the set of irrational numbers R \ Q are 
both dense in R. Therefore, between any two real numbers there are infinitely many 
rational numbers and infinitely many irrational numbers. 


Exercises 2.4 


1. Let a >0 and 6 > 0. Prove that there exists an n € N such that B <a. 

2. Let a < b be real numbers. Prove that there exists an n € N such that a+ 4 < 0b. 
*3. Prove Lemma 2.4.8. 

A, Let A={x EQ: 2 < 2}. Prove that sup(A) = 2. 

5. Let A= {x € Q: 2? < 2}. Prove that sup(A) = v2. 


6. Let g > 0 be a rational number. Prove that for all z,y € R, if x < y, then there 
exists an irrational w such that x < qw < y. 


7. Let «x < y<a<_b be real numbers. Prove that there exists rational numbers q 
and r such that x <q<yanda<qt+r<b. 


8. Let a € R. Suppose that a < 4 for all n € N. Prove that a < 0. 
9. Let a,b € R. Suppose that a < 6+ 4 for all n € N. Prove that a < b. 
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10. Let x > 0 bea real number. Prove that there is an n € N such that 4 <o<n. 


11. Let a and b be real numbers such that a < b. Show that there are infinitely many 
rational numbers between a and 6. 


Exercise Notes: For Exercises 4 and 5, apply proof strategy 2.3.20. 


2.5 NESTED INTERVALS THEOREM 


We conclude this chapter with two additional applications of the completeness axiom, 
namely, the nested intervals theorem and a theorem of Georg Cantor that shows that 
the set of real numbers is uncountable. These two interesting results are sometimes 
applied in real analysis. 


Definition 2.5.1. Let {J, : n © N} be an indexed set of closed intervals of R such 
that I, 2 In4i for all n € N; that is, 


I, DIp D113 DIg DI5D--- 


Then {J,, : n € N} is called a family (or set) of nested closed intervals. 


Of course, the notation A > B means B C A. The completeness axiom will be 
used in the proof of our next theorem, which states that for any nested family of 
closed intervals, there is a point that belongs to all of the intervals in the family. 


Theorem 2.5.2 (Nested Intervals Theorem). Let {J,,:n € N} be a family of nested 


closed intervals of R. Then () I, 4 2, that is, there exists an x € R such that x € I, 
neN 
for alln EN. 


Proof. Let {I, : 2 € N} be a family of nested closed intervals. For each n € N, let 
In = [An, bn] where ap, < by are real numbers. Thus, for all 7,7 € N, 


it 4, then a; Ss a; < be < 0, 


as illustrated in Figure 2.6. Let A = {a1, a2,a3,...}. The set A is bounded above. In 


Is hb th 
ts J 


a a2 a3 bz baby 


Figure 2.6: Family of nested closed intervals. 


fact, each b, is an upper bound for A (see Exercise 1). Let 6 = sup(A). Since f is an 
upper bound for A, we have that a, < 6 for all n € N. Moreover, as each b, is an 
upper bound for A and £ is the smallest such upper bound, we conclude that 8 < by, 
for all n € N. Hence, a, < 8 < b, for all n € N; that is, 6B € [an,b,] for all n € N. 
Therefore, () I, # ©. 


neN 
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Definition 2.5.1 concerns a nested family of closed intervals. This definition can 
clearly be generalized to arbitrary sets of real numbers. 


Definition 2.5.3. Let {A, : n € N} be an indexed set of subsets of R such that 
An 2 An+1 for all n € N; that is, 


A; D Ap D A3 D AyD Az D::: 


Then {A,,:n € N} is a family (or set) of nested sets. If A, D An4+1 for alln EN, 
the family is said to be proper. 


The proof of the above Theorem 2.5.2 clearly applies the completeness axiom. 
Recall that the set of rational numbers Q does not satisfy the completeness axiom 
(see page 38). Let us define the notion of a closed interval in the number system Q. 
Given a € Qand bE Q, wherea <b, lett T= {xe Q:a<a< }d}. Let us call Z 
a closed Q-interval. Using Theorem 2.4.6 and Remark 2.4.7, one can define a family 


{Z, : n € N} of nested closed Q-intervals such that (| Z, = 2. It follows that any 
nen 
valid proof of Theorem 2.5.2 requires the completeness axiom. 


2.5.1. IR is Uncountable 


We end this chapter with a curious application of Theorem 2.5.2 due to Georg Cantor, 
a brilliant mathematician who single-handedly created the branch of mathematics 
now called set theory. Cantor first observed that two sets A and B have the same size 
if and only if there is a one-to-one correspondence between A and B, that is, there is 
a way of evenly matching the elements in A with the elements in B. In other words, 
Cantor noted that A and B have the same size if and only if there exists a bijection 
f: A— B. In this case, Cantor said that A and B have the same cardinality. 
During his set theoretic investigations, Cantor introduced the following definition. 


Definition 2.5.4. A set A is said to be countable if A = @ or there exists a 
surjection f: N > A. If A is not countable, then A is said to be uncountable. 


Let A be a nonempty countable set and let f: N > A be a surjection. For each 
n€N, let a, = f(n). We can now itemize the elements of A in the following list: 


1, 42,43,44,.--- 


As f: N= A is onto A, it follows that every element in A appears in the above list. 
The proofs of the next two results are outlined in Exercises 11 and 12, respectively. 


Lemma 2.5.5. If f: N > A is a surjection, then there is an injection g: A > N. 
Theorem 2.5.6. If A is a countable infinite set, then there is a bijection f: N > A. 


Theorem 2.5.6 implies that a countable infinite set has just as many elements as 
the set N of natural numbers. It thus follows that a set is countable if and only if it 
either finite or has the same cardinality as the set of natural numbers. 

After introducing the notion of a countable set, Cantor asked a simple question: 
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Is every infinite set countable? In the pursuit of this question, Cantor proved that the 
set of rational numbers is countable (see Exercise 10). Cantor then conjectured that 
the set of real numbers is also countable. It came a complete surprise to Cantor when 
he discovered the next remarkable theorem. First, we establish a simple lemma. 


Lemma 2.5.7. Let [a,b] be a closed interval and let x € R. Then there is a closed 
subinterval [a’, b'] C [a, 6] such that x ¢ |[a’, b’]. 


Proof. Let [a, 6] be a closed interval and let x € R. If x ¢ [a,b], then let [a’, b'] = |a, b]. 
If x € [a, bj, let c € (a,b) be such that c 4 x. If c < a, then let [a’, b'] = [a,c]. Ifa <c, 
then let [a’, b’] = [c, b]. In each case, there is an [a’, b'] C [a,b] where x ¢ [a’, b’). 


Theorem 2.5.8 (Cantor). The set of real numbers R is uncountable. 


Proof. Suppose, for a contradiction, that R is countable and let f: N ~ R bea 
surjection. For each n € N, let x, = f(n). Thus, as f is onto R, every real number 
appears in the list 

L1,2,13,04,... (2.5) 


Let [a1,b;] be a closed interval such that x1 ¢ [ai,b;]. By Lemma 2.5.7, there is a 
closed interval [a2, b2] C [a1, bi] such that x2 ¢ [a2, bg]. Continuing in this manner, we 
obtain a nested family {[an, b»] : 2 € N} of closed intervals such that (W) tn € [an, bn] 
for every n € N. Theorem 2.5.2 implies that there is a real number x such that 
(A) & € [dn, by] for all n € N. Thus, by (¥) and (A), x does not appear in the list (2.5). 


Hence, f is not onto R. This contradiction implies that R must be uncountable. 


We now know that Q is countable and R is uncountable. Both of these sets are 
infinite, but Theorem 2.5.8 implies that R is “more infinite” than the set of rational 
numbers Q. After proving that the set of real numbers is uncountable, Cantor was 
able to prove that there is an increasing sequence of larger and larger infinite sets. 
In other words, Cantor showed that there are “infinitely many different infinities,” a 
result with clear philosophical and mathematical significance. 


Exercises 2.5 


*1. Let {Z, : n € N} be a family of nested closed intervals. For each n € N, let 
In = [an, bn] where a, < by, are real numbers. Thus, for all 7,7 € N, 


ifi<y, then ay S a; <b; < b;. 
Let n € N. Show that az < 6, for all k EN. 


2. Let {J, :n € N} be a nested family of closed intervals of R, where I, = [an, bn]. 
Let A = {aj,a2,a3,...}. In the proof of Theorem 2.5.2 it is shown that the 
supremum 3 = sup(A) exists. Now, let B = {b1, ba, b3,... }. 

(a) Show that 7 = inf(B) exists. 

(b) Show that 8 < ¥. 

(c) Show that if 8 <7, then (1) [, = [8,9]; and if G@=~7, then () In = {6}. 
neN 


neN 
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. For each n €N, let O, be the open interval O, = (dn, by) where an < bp are real 


numbers. Suppose that a, < dni1 and by41 < by for all n € N, that is, 
ay < ag < AR <0 SM On Set tt SOA << <3 < bg < By. 


Prove that () On # 2. 


neN 


. Find a nested proper family of open intervals whose intersection is empty. 


5. Find a nested proper family of closed intervals whose intersection has exactly 


*10. 


*11. 


*12. 


one point. 


. Find a nested proper family of closed intervals whose intersection is the closed 


interval [1, 2]. 


. Let {I, : n € N} be the nested family of intervals of R defined by I, = [n, co) 


for alln € N. Show that () J, = 2. 
nen 


. Let f: N > Q be a surjection and let {J, :n € N} be a family of nested closed 


intervals such that f(n) € I, for all n € N. Let 6 < y be as in Exercise 2. Show 
that 6 = 7 and that { is an irrational number. 


. Let f: N > Q be a surjection. By modifying the proof of Theorem 2.5.8, show 


that there is a family {Z,, : n € N} of nested closed Q-intervals (page 51) such 
that f(n) ¢ Z, for all n € N. Conclude that 1 Z, =. 


neN 


Consider the function f: N > Q defined by 
—1)°2, if n = 2°3°5¢ and a,b,c EN; 
ln) = 4? 
0, otherwise. 


Show that f is onto Q. So Q is countable. (By Theorem 1.4.3, f is well-defined.) 
Let f: N- A be a surjection. Define g: A > N by 


g(a) = n if and only if n is the least natural number such that f(n) =a, 


for each a € A. Prove that g is one-to-one. 


Let A be an infinite countable set. By Definition 2.5.4 and Lemma 2.5.5, there 
is an injection g: A > N. Since A is infinite, the range of g must be an infinite 
subset of N. Let R = ran(g). Since R is an infinite set of natural numbers, we 
can list the elements of R in strictly increasing order, say R = {n1,n2,n3,...} 
where n; < ni41 for all i € N.' Define f: N— A by f(z) = g7!(nj), for all ic N. 
Prove that f: N > A is a bijection. 


Exercise Notes: For Exercise 1, there are two cases: either n < k or k < n. Exercise 4 
shows that the conclusion of Theorem 2.5.2 does not necessarily hold if the nested 


intervals are not closed. 


'The sequence ni, n2,73,... can be defined by induction using the Well-Ordering Principle 1.4.1. 


Taylor & Francis 
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CHAPTER 3 


Sequences 


A sequence is an enumerated infinite list of real numbers. In this chapter, we will 
investigate sequences and prove a number of important theorems about the limit 
of a sequence. The proofs will often require the completeness axiom. Sequences are 
fundamental in real analysis and, while you may already be familiar with sequences, 
it is useful to have a formal definition. We shall define a sequence to be a function 
from the set of natural numbers into the set of real numbers R. 


Definition 3.0.1. A sequence is a function s: N + R. We shall denote the value 
s(n) by sp, where s,, is called the nth term of the sequence. We will write s as (s,), 
as (81, S2,53,-..) or as (S,)°°, when we want to emphasize that the index variable n 
begins with 1. Moreover, (s,,) is said to be a sequence of distinct points if 5, 4 sm, 
whenever n 4 m. 


eal a rahi siecle ait ates laa 
1234567 8 9101) +e 
Py 


Figure 3.1: Functional representation of a sequence: s(4) = s4 > 0 and s(6) = s¢ < 0. 


Consider the sequence (s,) where s, = +. Then we can write (s,) as (+) or as 
el, s, 3 ‘, ...). A constant sequence is denoted by (a) or (a,a,a,a,...), where aisa 
fixed real number. 

One can also have sequences of the form (s,)°°, = (Sk, Sk41,---) Where k > 1 or 
k; = 0. However, one can easily re-express such a sequence as one starting at 1. Define 
(tn) 1 by th = Sn+k—1 for all n > 1. Then (¢,)7°., = (Sx, Se41,---). For example, 


(x44) o = (4)cc. 
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3.1 CONVERGENCE 


The limit concept is one of the oldest and among the most important concepts in 
mathematical analysis. The formal definition of the limit of a sequence has a long 
and deep history. Around 250 BC, Archimedes informally used the convergence of 
sequences, via the method of exhaustion, to compute approximations of 7 and to 
show that zr? equals the area enclosed by a circle of radius r. The mathematically 
precise notion of the limit of a sequence that we use today was not formulated until 
the 19th century by Cauchy and Weierstrass. 

The limit of a sequence (s,,) identifies the long-term behavior of the sequence; 
that is, what happens to s, for larger and larger values of n. If the terms s, get 
closer and closer to a real number @, then we say that (sp) converges to £ and that 
€ is the limit of the sequence. However, the notion of a sequence getting closer and 
closer to a real number seems to be quite vague and very likely cannot be be used 
to rigorously prove theorems about convergence. It took over a hundred years after 
Newton and Leibniz before the following question was correctly addressed: 


Can the notion of sequential convergence be given a precise mathematical 
definition? 


In 1821, Cauchy responded to this question with the following definition: 


When the values successively approach ... a fixed value, eventually differ- 
ing from it by as little as one could wish, that fixed value is called the limit 
of all the others. 


Cauchy’s definition of convergence is correct but it lacks sufficient precision. This 
shortcoming was addressed around 1870, when Karl Weierstrass proposed our next 
definition, which is now the standard definition that is used in real analysis and in 
calculus. 


Definition 3.1.1. A sequence (s,) is said to converge to the real number ¢ provided 
that for all ¢ > 0, there exists a natural number N such that for alln € N, ifn > N, 
then |s, — @| <. 


e N 


Figure 3.2: For all n > N, we have |s, — €| <«. 


Figure 3.2 illustrates Definition 3.1.1. If a sequence (s,) converges to @, then ¢ 
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is called the limit of the sequence (s,) and we write lim Sn = &. If a sequence 
(Sn) does not converge, then we shall say that (s,) diverges. The logical form of 
Definition 3.1.1 can be expressed as 


(Ve > O)(AN EN)(Vn EN)\(n > N > [5p —4] <6) (3.1) 


and it is this logical form that motivates the following proof strategy. 


Proof Strategy 3.1.2. To prove that lim Sn = £, use the proof diagram: 


Let ¢ > 0 be an arbitrary real number. 
Let N = (the natural number you found). 
Let n > N be an arbitrary natural number. 
Prove |s, — €| <. 


To apply proof strategy 3.1.2 on a sequence defined by an explicit formula, we 
first let ¢ > 0. Then we find a natural number N such that when n > N, we can 
prove that |s,, — ¢| < ¢. To find the desired N, we undertake the following: 


Using algebra and properties of inequality on the expression |s,, — ¢|, we 
“extract out” a larger value that resembles 1 


We shall then use this larger value to find N so that when n > N, we will have 
that |s, — ¢| < ¢. We will illustrate this idea in our proof analysis of the next four 
theorems. Before we discuss these theorems, we identify three properties of inequality 
that are very useful when proving theorems about convergence. 


Quotient Principles of Inequality 3.1.3. Let a,b,c,d be positive real numbers. 


(1) Given the ratio §, if @ < ¢, then you can conclude that ¢ < §. 
(Replacing a numerator with a larger value yields a larger ratio.) 


(2) Given the ratio $, if d < b, then you can conclude that # < §. 


(Replacing a denominator with a smaller value yields a larger ratio.) 

(3) Given the ratio ¢, if a < cand d < 6, then you can infer that ¢ < $. 
(Replacing a numerator with a larger value and a denominator with a smaller value 
yields a larger ratio.) 


Example. The above property 3.1.3(2) implies the following assertions: 


1. t< i whenn>WN>0. 


2. a < Tr when n > N > 0, by Theorem 1.1.11 on page 6. 
3. oe < = when 2” >n>0. 


Theorem 3.1.4. lim 1 = 0. 


Proof Analysis. We apply proof strategy 3.1.2 to the sequence (4). Let ¢ > 0. We 
need an N € N such that ifn > N, then |+ —0| < ¢. Since |+ —0| = - we thus 
need an N € N so that 1 < «when n > N. Solving the inequality 1 < € for n, we 
conclude that n > t. So if we take N > i, then we can prove the desired result (NV 
exists by the Archimedean property, as stated in Theorem 2.4.1). We now present a 


logically correct proof that follows the structure outlined in proof strategy 3.1.2. 
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Proof. Let ¢ > 0. Let N > 4 be a natural number. Let n € N be such that n > N. 
We prove |+ — 0| < € as follows: 


1 1 
——0]=|-]_ by algebra 
n n 
1 1 
= because — > 0 
n n 
oe : b >wN 
N ecause n 
1 1 
ea because N > — 
= € 
(4 
=€E by algebra. 


Therefore, |+ — 0| <E. 
. 1 as 
Theorem 3.1.5. Jim, a 0. 


Proof Analysis. Let ¢ > 0. We need an N € N such that ifn > N, then Fa — 0| <€. 
Since as — 0) a Tr we thus need an N € N so that Fr <e¢when n > N. Solving 


the inequality a7 < ¢ for n, we obtain n > =. So if we take N > =, then we can 


prove the theorem. We now apply proof strategy 3.1.2 and present the proof. 


Proof. Let ¢ > 0. Let N > = be a natural number. Let n € N be such that n > N. 


Thus, Fa < Tr We prove that lS — 0| < as follows: 


1 1 
Ra 0 = Ba by algebra 
1 1 
—— Ti because ai >0 
1 
< because n > N 


1 1 
< —— _ because N > x 
1 E 
2 


=e by algebra. 


Therefore, 


1 
Va — 0| SE. 
In the future, we will use N to denote a natural number. 
Theorem. lim 1+ oa =1. 
noo 


Proof Analysis. Let ¢ > 0. We need an N € N so that ifn > N, then [1 + oa _ 1| <€. 
Since [1+ x — 1| = oan we must find an N € N such that if n > N, then x <€. 
Solving the inequality aia < ¢ for n is difficult. So we take a different approach. By 
Exercise 2 on page 26, n < 2” and so ao < 1. when n > 1. By solving the inequality 
4 < e for n, we obtain n > 1. So if we take N > z, then we can prove the theorem. 


We now present a proof which is guided by proof strategy 3.1.2. 
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Proof. Let ¢ > 0. Let N > 4 be a natural number. Let n € N be such that n > N. 
We prove that |(1+ 3) — 1] < as follows: 


1 1 
(+5) -1| = lo by algebra 

o 1 1 

=o because a >0 
1 

<- because n < 2” 
n 

< : b >N 
N ecause n 
1 1 

Sa because N > — 
= € 
€. 

=€ by algebra. 


Therefore, [1 + oa = 1 <E. 


In the proof analysis for each of the above three theorems, we were able to find 
the required N without much difficulty. Specific sequences with more complicated 
definitions may require more work to find N. This work can be reduced by applying 
the Quotient Principles of Inequality 3.1.3 and the Sum and Product Principles of 
Inequality 1.1.8. In particular, when dealing with a ratio of polynomials in n, these 
two principles justify the techniques presented in the following remark. 


Remark 3.1.6. Given a ratio of polynomials, we illustrate a method for finding a 
larger and simpler ratio. The first item below discusses how to find a larger numerator, 
while the second item considers how to find an eventually smaller denominator. 


1. (Larger Numerator) Given a polynomial in n, say cn* + 6n? — 10, with highest 
power n* where c > 0, by dropping the negative terms, we obtain a larger value. 
For example, cn* + 6n® — 10 < cn* + 6n3. Now since n? < n*, we see that 
cn® + 6n? < en* + 6n*. Thus, en® + 6n? — 10 < (c+ 6)n* for all n EN. 

2. (Smaller Denominator) Given a polynomial in n, say cn* +6n? — 10, with highest 
power n* where c > 0, by dropping the positive terms with lower powers, we get 
a smaller value. For example, cn* — 10 < en* + 6n3 — 10. As cn* — 10 contains a 
negative term, let 0 < s < cand find an m € N such that such that sn* < en*—10 
when n > m. Hence, sn® < cn® + 6n3 — 10 for all n > m. 


We will apply Remark 3.1.6 in the proof analysis of the following theorem. 


‘ 2 tan 
Theorem. lim 222-3 = 1. 
n— 00 N—N—5 


Proof Analysis. Let ¢ > 0. We must find a natural number N such that ifn > N, 
then nten-8 — | <e. By algebra, we obtain 


3n + 2 


Pl 3 | 
n2—n—5 


n2—n—5 
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We see that 3n + 2 > 0 and n° — n? —5 > 0 when n > 3. Thus, if n > 3, we have 


3n + 2 


| - 3n + 2 
n2—n—5| nn 


2 


—n—5 


We now need to find an N > 3 such that ifn > N, then a < ¢. Solving the 


5 
inequality te < e¢ for n is difficult. So we shall take advantage of Remark 3.1.6. 


First, we will get a real number 6b > 0 such that 


3n +2 < bn, for alln EN. (3.2) 
Then we will get a real number s > 0 such that 
sn? <n? —n—5, for all “large” n EN. (3.3) 


To find the 6 in (3.2), notice that 3n + 2 < 3n+2n = 5n for all n € N. So, we 
shall let b = 5. We now need to identify an s such that (3.3) holds. Let s = 1/2 (see 
Remark 3.1.6(2)). We must find m so that ifn > m, then (x) $n? < n?—n—5. 
This latter inequality is equivalent to n? < 2n? — 2n — 10, which is equivalent to 
10 < n? — 2n = n(n — 2). Thus, when n > 5 we have that (x) holds. Therefore, for 
all n > 5, we have that 


n2—n—5~ 1n? n- 


a 
n4—n—5d5 5 


3n + 2 | - 3n + 2 ce Dn 10 


Solving the inequality tu < ¢ for n, we see that we must let N > max{ 7, By. 


Proof. Let ¢ > 0. Let N > max{*2, 5} be in N. Let n € N be such that n > N. Since 
n > 3, we have that 


n? +2n—3 |- 3n +2 


n2—n—5 n2—n—5 


because 3n+2 > 0 and n?—n?—5 > 0. In addition, as n > 5, we have that 3n+2 < 5n 
and $n? <n? —n-—5. Therefore, since n > N > max{*2, 5}, we conclude that 


n? +2n—3 3n+2_ — 5n 10 _ 10 _ 10 
= — = €. 
n?—n—5 n?—n—5 ~ in? n N w 


This completes the proof. 


Suppose in a proof that you are assuming that a given sequence converges. Our 
next strategy will be useful when dealing with such an assumption. 


Assumption Strategy 3.1.7. If you are assuming that lim Sy = &, then for any 
€ > 0, there is an N € N such that |s, — ¢| < ¢ foralln > N. 


So, in a proof, when you are assuming that Jim, Sn = €, you can conclude that 
for any positive value v > 0, there is an N such that |s, — ¢| < v for alln > N. We 
shall express this observation as “we can make |s,, — | as small as we want.” We now 
apply this idea to prove the following theorem. 
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Theorem 3.1.8. If lim s, = @,¢>0 ands, > 0 for alln > 1, then lim \/s, = Ve. 
noo noo 


Proof Analysis. Suppose lim Sn = €. We need to prove that lim /sn = V0, where 
€>0 and s, > 0 for all n > 1. Our proof will have the following logical structure: 
Assume lim sy. 
Let ¢ > 0 be an arbitrary real number. 


Let N = (the natural number you found). 
Let n > N be an arbitrary natural number. 


Prove | Vin — vq <eé. 


We must find a natural number N such that if n > N, then | Vn — vi < ¢. Here is 
the basic plan that we will apply to get N. 


Using algebra and properties of inequality on the expression F [Sn — vil, 
we “extract out” a larger value that contains |s,, — ¢| and no other occur- 
rences of 8p. 


Since lim Sp, = €, we can make |s, — ¢| “as small as we want.” We should then be 
noo 

able to make | Vin = vi < e and find N. Let us now execute this plan! First we 

start with | V3n - ve and extract out |s,, — ¢| as follows: 


rationalizing the numerator 


| (V5n — V2) (an + V2) 
vv = 1 ee 


S, —2£ 
= | —_} by algebra 
Wa +v0 
[sn — ¢| 
Se because ,/s, + VZ>0 
Jan t vb 
n—£ 
< Is | because V0 < V/s, + Ve. 


eae 


= [sn=é |sn—€| ; 
\/Sn Ve | < i and we extracted out the larger value Vt that contains 
|sn—e| 


VE ~ & then we will have that 
|V3n — vel < ¢. How small must |s, — ¢| be in order to ensure that nl <¢? By 


Hence, 


|s, — | and no other occurrences of s,. So if 


solving this latter inequality for |s, — ¢|, we obtain |s, — ¢| < eV@. Hence, we need 
an N so that |s, — ¢| < eV when n > N. Since lim Sn = &, there is such an N. 
noo 


This is the value for N that we will use in the proof. 
Proof of Theorem 3.1.8. Assume that lim Sn = € where @ > 0 and s, > 0 for all 
n>1.Lete >0. As lim Sn = ¢, there is an N € N such that (A) |s, — €| < eV for 
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alln > N. Now let n > N. Thus, 


pee 
1 (fan + Ve) 


van — va = 


rationalizing the numerator 


by algebra 


| 
w 
3 
| 
oS 


[Sn —¢| 
= —___ because 4/8, + Ve>0 
Jin + ve we 
< ae i] because V0 < Jsn t+ ve 
< ae =€ by (a). 


Therefore, 


Sa = Ve | < «and this completes the proof. 


Theorem 3.1.8 also holds if € = 0 (see Exercise 16). It may seem obvious that 
a convergent sequence cannot have more than one limit; but, this requires a proof, 
which is given below. In this proof, we will use two tools that are commonly used in 
real analysis; namely, the triangle inequality and the algebraic “trick” of adding and 
subtracting the same value. 


Theorem 3.1.9 (Uniqueness of the Limit). If the sequence (s,,) converges, then the 
sequence has only one limit. 


Proof. Assume that (s,) converges. Suppose, for a contradiction, that ¢ # @ are 
both limits of the sequence (s,). Thus, ¢ = |€— | > 0. Since (s,) converges to ¢ 
there is an N € N such that for all n > N, |s, — €| < 5. Also, since (sp) converges 
to @ there is an N’ € N such that for all n > N’, |s, — ¢’| < §. Therefore, for all 
n > max{N, N’}, we have that 


& 


5 © 


Je | = |(2- 8) + (Ge — &)] S [= snl + loo — 1 < + 


Hence, |¢ — @’| < e. But this contradicts the fact that « = |¢— ¢’|. 


We next present an equivalence that is sometimes useful. 
Theorem 3.1.10. Let (s;,) be a sequence and @ € R. Then lim S, = ¢ if and only 
if lim (Ss, — 2) = 0. 


Proof. Assume that lim s, = ¢. Let ¢ > 0. Since lim s, = @, there is an N EN 
N—- Oo N—- Oo 


where |s, — ¢| < ¢ for alln > N. Let n > N. Thus, |(s, — €) — 0| = |s, — €| < e. 
Hence, lim (Ss, — €) = 0. The converse follows similarly. 


When assuming that a given sequence converges and you want to prove that 
another sequence converges, then assumption strategy 3.1.7 can be useful. Many 
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times we will be assuming that lim Syn = € and will be working with ¢ > 0. Using 
assumption strategy 3.1.7, we can conclude that for any positive e’ < € (e.g., e’ = §), 
there is an N’ such that for all n > N’, we we have |s, — €| < ¢’. 
Theorem 3.1.11. Let (s5,,), (a,) be sequences and let ¢ € R. If 


(1) |sn — | < kla,| for alln > m, where k > 0 andmeN, 

(2) lim an =0, 
then lim s, = @. 

N—- Oo 

Proof Analysis. Assume (1) and (2). Thus, from (2), we can make |a,| = |a, — 0| 
“as small as we want.” Let ¢ > 0. We need an N € N such that if n > N, then 
|S, — €| < €. As |s, — | < k|a,| when n > m, we need an N > m such that ifn > N, 
then k|an| < €. Solving this latter inequality for |a,|, we obtain |a,| < §. Since 
Jim Gn = 0, there is an N’ such that whenever n > N’ we have |a, — 0| = |an| < =. 
So we will use N = max{N’,m}. So ifn > N, then n > N’ and n> m. 


Proof. Assuming (1) and (2) we shall prove that lim Sn = €. To do this, let ¢ > 0. 
By (2), lim a, = 0. Thus, there is an N’ €N such that («) |an —0| = |an| < § 
for all n > N’. By (1), we have that («) |s,—¢| < kla,| for all n > m. Let 


N = max{m, N’}. Let n > N. Thus, 
|S, —£|<klan| by (x) because n > N >m 
<k (5) by (x) because n > N > N’ 
=€ by algebra. 


Therefore, |s,, — ¢| < ¢. This completes the proof of the theorem. 


For a simple application of Theorem 3.1.11, let 2 € N and consider the sequence 


(4) where k # 0. Since | 4 — 0) < |k| + for all n € N, Theorems 3.1.4 and 3.1.11 


imply that lim « = 0. Theorem 3.1.11 can also be used to reduce the amount of 


work used to prove certain limit equations. 


Example. We will use Theorem 3.1.11 to prove that lim Bliss 


noo 3n?+n—10 


= 3. To apply 
this theorem, we must find an upper bound for 


On? —n+1 
3n? +n—10 


—4n + 31 
3n? +n — 10 

for sufficiently large n. Clearly, |—4n + 31| < 4n+31 < 4n+31n = 35n for all n. For 
the denominator, we will determine when n? < 3n? — 10 (see Remark 3.1.6(2)). This 


latter inequality is equivalent to 10 < 2n? which holds when n > 3. Thus, if n > 3, 
we have that 


1 
= 35-. 
n 


On? —n+1 _ 
3n? +n — 10 a 


—4n+31 | < 35n 


3n2 +n —10 n? 
9n2—n+1 _ 3 


Since jim 5; = 0, Theorem 3.1.11 implies that Jim, aannid = 
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Theorem 3.1.11 implies the following two useful corollaries. 


Corollary 3.1.12. Let x be a such that |x| < 1. Then lim a, 


Proof. If « = 0, then clearly lim «” = 0. Assume that 0 < |z| < 1. Since izT = i, 
there isa c > Osuch that 1+c= ae By Bernoulli’s inequality (Exercise 5 on page 27), 


(1+ c)” >14+4 nc, for every n € N. Hence, art = (l+c)” >1+nc > ne, for all 
neéEN. So 


1 
jz” — 0] =|[2|"< —, 
nc 


for alln > 1. As lim I = 0 and t > 0, Theorem 3.1.11 implies that lim xz” = 0. 


The next corollary can sometimes be used to prove that a particular sequence 
(un) converges to a specific real number r. 


Corollary 3.1.13. Suppose that lim Sn = &. Let r € R and (up) be such that 


|Un —r| <k|s, — €| for alln > m, for some k > 0 and m €N. Then lim i= 


Proof. By Theorem 3.1.10 we have that lim (Sy — 2) = 0. Since |un — r| < k|s, — | 
for all n > m, where k > 0 and m € N, Theorem 3.1.11 implies that lim Un =f. 


If (s,) converges to @, then for ¢ > 0 there is an N € N such that |s,, — ¢| < e for 
alln > N. So, for any K > N, we have |s, — ¢| < ¢ for alln > K as well. 


Remark 3.1.14. Let (s,,) be a sequence and let  € R. What does it mean to say 
that (s,) does not converge to ¢? By taking the negation of the logical form (3.1), 
we see that “the sequence (s,,) does not converge to 2” means the following: 


There is an € > 0 such that for all N EN, there is ann > N so that |s, — ¢| >. 


Neighborhoods 


Definition 3.1.15. Let « € Rand let ¢ > 0. The open interval (x—¢, +e), centered 
at x, is said to be a neighborhood of z, and is denoted by U?. 


Let x be a real number and let U? be the neighborhood (x — ¢,x+ €) of « where 
é > 0. Then for any real number s, we have that s € U? if and only if |s — 2| < e. We 
will write U” to denote a neighborhood of x when it is not important to specify e. 
The following theorem just states that the notion of convergence can be expressed in 
terms of neighborhoods (see Figure 3.3). 


Theorem 3.1.16. Let (s,,) be a sequence and £ be a real number. Then the following 
are equivalent: 


1. The sequence (s,,) converges to @. 
2. For alle > 0 there is an N € N such that for alln € N, ifn > N, then |s, — €| <e. 


3. For every neighborhood U* there is an N € N such that for all n EN, ifn > N, 
then s,, € U*. 
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$1 82 §3 S84 85 --- ee Sn 


Figure 3.3: For all n > N, we have |s, — €| <«. 


Corollary 3.1.17. Let (s,) be a sequence of distinct points and suppose that (s,,) 
converges to £. Then every neighborhood of @ contains an infinite number of points 
from the sequence (s,). 


Proof. Let U* be any neighborhood of ¢. Since the sequence (s,) converges to @, 
Theorem 3.1.16 states that there is an N € N such that for alln € N, ifn > N, then 
sy, € U*. Thus, an infinite number of points from the sequence (s,) are in U*. 


Given a sequence (s,,) that converges to ¢. Theorem 3.1.16 implies that for every 
neighborhood U* of ¢ there exists an N € N such that s, € U‘ for all n > N. We can 
say, in this case, that the terms of the sequence (s,) are eventually in U‘. 

Our next result implies that every real number is the limit of a sequence of rational 
numbers. 


Lemma 3.1.18. Let D C R be dense in R. Let x be any real number. Then there is 
a sequence (d,,) that converges to x where d, € D for all n > 1. 


Proof. Since D C R is dense in R, for each n € N, there is a d, € D such that d, is 
in the interval (x — 4,2 + +) and so, |x — dn| < 4. Since lim + = 0, Theorem 3.1.11 


implies that Jim, dn = x. 


Bounded Sequences 


We now consider some other properties that a sequence (s,) may possess. These 
properties are not really new, as they are very similar to the definitions on bounded 
sets discussed in Chapter 2. 


Definition 3.1.19. A sequence (s,,) is bounded above if there is a real number M 
so that s, < M for alln EN. 


Proposition 3.1.20. Let (s,) be a sequence that is not bounded above. For every 
real number B and natural number m, there exists an n > m such that s, > B. 


Proof. Suppose (s,) is not bounded above, and let B € R and m € N. Now let 
M = max{s1, 52,...,5m,B}. Since (s,) is not bounded above, there is an n € N 
such that s, > M. Because M > B we have that s, > B. Furthermore, because 
Sn > M > s; for all i < m, we must have that n > m. 


Definition 3.1.21. A sequence (s,,) is bounded below if there is a real number M 
so that M < s,, for alln EN. 


A proof similar to that of Proposition 3.1.20 will establish the following result. 
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Proposition 3.1.22. Let (s,) be a sequence that is not bounded below. For every 
real number B and natural number m, there exists an n > m such that s, < B. 


Definition 3.1.23. A sequence (s,,) is bounded if there is are real numbers a and 
6b such that a < s, < 6 for alln EN. 


Remark. A sequence (s,,) is bounded if and only if there is an M > 0 so that 
|sn| < M for all n € N (see Theorem 2.3.2). Thus, a sequence is unbounded if for 
every M > 0, there is an n € N such that |s,| > M 


Here are some examples that illustrate the above definitions. 


1. The sequence (n?) is bounded below but it is not bounded above. 


2. The sequence ((—1)") is bounded but it does not converge. 


3. The sequence (co" is bounded and it converges. 


Let (s,) be a sequence. Suppose there is a B > 0 and an N such that |s,| < B 
for all n > N. This leaves a finite number of terms of the sequence that may not be 
similarly bounded by B. But a finite set of terms is always bounded. Thus, one can 
conclude that the entire sequence is bounded; that is, that there is an M > 0 such 
that |s,| <M for all n > 1. The following example affirms this fact. 


Example. Let (s,) be the sequence where s, = 2+ + where n > 1. Let B=2+ 4. 
Notice that for all n > 10, we have s, < B. But all of the values s1,...,519 are 
greater than B. Let M = max{sj,...,519, B} = 3. We now have that s, < 3 for all 
n > 1. Thus, the sequence (s,,) is bounded. 


Theorem 3.1.24. If the sequence (s,,) converges, then (s,,) is bounded. 


Proof. Suppose that Jim, Sn = €. Corresponding to ¢ = 1 there is an N € N such 
that |s, — | <1 for alln > N. Thus, |s,| — || < |sp — | < 1 for alln > N. Hence, 
|Sn| < |€) +1 for alln > N. Let M = max{|s1|,...,|sv|,|€) +1}. Thus, |s,| < M 
for all n € N. Therefore, (s,,) is a bounded sequence. 


By expressing Theorem 3.1.24 in terms of its contrapositive, we conclude that if 
a sequence (s,,) is unbounded, then the sequence (s,,) diverges. 


Exercises 3.1 


1. Let aE R. Prove that the sequence (a + (—1)"22+1) is bounded. 
2. Let k £0. Use Definition 3.1.1 to prove that lim E = 0. 


3. Use Definition 3.1.1 to prove that Jim, faa =, 
4. Use Definition 3.1.1 to prove that Jim, a5 =3. 
5. Use Definition 3.1.1 to prove that Jim, gu =2 
6. Use Definition 3.1.1 to prove that Jim, Sut = 3. 


*16. 


. Use Theorem 3.1.11 to prove that lim 57 =0 and lim 
N—- Oo noo 
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2 
1 6n*+3n _ 3. 


2 2n?—5 


. Prove that the limits given in Exercises 3-6 hold by applying Theorems 3.1.11 


and 3.1.4. 


. Prove Theorem 3.1.8 using Corollary 3.1.13. 


. Let c € R be constant. Prove that lim c= c. 
N—- Oo 
. Suppose lim s, = @ and let c € R be a constant. Prove that lim (c+s,) =c+é. 
n—-0o N—+00 


. Let (s,) be a convergent sequence. Suppose lim Sn = € and let c € Rbea 
n [e,e) 


nonzero constant. Prove that lim (cs,,) = cé. 
N—- Oo 


. Suppose that lim s, = ¢. Prove that lim |s,| = |¢|. 
n—-0o noo 

. Suppose that lim |s,,| = 0. Prove that lim s, = 0. 
n—-0o noo 


. Suppose that lim Sp, = € and |s,| < M for all n > 1, where M > 0. Prove that 


Lita 62:30”, 
N—- Oo 


Suppose Jim, Sn = 0 where s, > 0 for all n > 1. Prove that Jim, \/Sn = 0. 


. Let (x) and (y,) be two convergent sequences. Prove that there exists an M > 0 


such that |z,| < M and |y,| < M for all n > 1. 


. Let (s,) be a convergent sequence. Suppose that lim Sn = €. Prove that there 
n?Co 


exists an M > 0 such that |s, + | < M for alln > 1. 


. Use Theorems 3.1.11 and 3.1.4 to prove that lim sin(n) _ Q), 
. Use Theorem 3.1.11 and Theorem 3.1.5 to prove that lim (/n+1—/n) =0. 
. Suppose that lim Sn = €. Prove that lim eee a 


. Prove Proposition 3.1.22. 
. Let (s,) be an unbounded sequence and let k 4 0. Prove that (s?) and (ks) are 


unbounded sequences. 


. Let x be a such that |z| > 1. 


(a) Prove that the sequence (xz”) is unbounded. 


(b) Let k £0. Conclude that (kx”) is unbounded. 


Exercise Notes: For Exercise 5, observe that n < 3n—2 when n > 1. For Exercise 6, 
observe that n < |2n—7| when n > 7. In this case, we would need N to be at 
least 7 and so, N > max{7, if} could be used in the proof. For Exercise 20, show 


that |Vn+ - vil = yrs oe a For Exercise 24(a), modify the proof of 
Corollary 3.1.12. 
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3.2 LIMIT THEOREMS FOR SEQUENCES 


In this section, we will derive some of the standard results that concern convergent 
sequences. In particular, we will show that the algebraic operations and the order 
relation < are compatible with the limit operation. 


3.2.1 Algebraic Limit Theorems 


An algebraic limit theorems states that if you know the limits of some given sequences, 
then you can determine the limit of a new sequence that is an algebraic combination 
of the given sequences. Such limit theorems typically have the following form: 


Theorem. If lim s, = sand lim t, =t,then lim u, = u where (u,) is constructed 
n-co noo no 


from the sequences (s,) and (t,), and wu is a function of s and t. 


Given a theorem of this form, how does one prove that lim uy, = u? 
N—- Oo 


Proof Strategy 3.2.1. To prove that lim Un = u, apply the following diagram: 


Assume lim s, = s. 
n-co 

Assume lim t,, = t. 
n-co 

Let ¢ > 0 be a real number. 

Let NV = (the natural number you found). 
Let n > N be a natural number. 

Prove |un — ul <e. 


So given ¢ > 0, one must find a natural number N such that if n > N, then 
|u, — u| < €. Here is the basic idea that one can apply to get N. 


Using algebra and properties of inequality, from the expression |uy — u| 
“extract out” a larger value containing |s, — s| and |t,, — t|, and no other 
occurrences of S,, or tn. 


Since Jim Sn = s and Jim tn, = t, we can make |s,, — s| and |t, — t| “as small as we 
want.” One should then be able to find the desired N. We will apply this technique 
in our proof analysis of the next two theorems. 

Given two convergent sequences (s,) and (tn), we will next prove that the sum 
sequence (8, + t,) converges. We first present an analysis that demonstrates how to 
employ proof strategy 3.2.1 and the above discussion. 


Theorem 3.2.2. If lim s, = s and lim t, =¢, then lim (s, +t,)=s+t. 
Noo Noo Noo 


Proof Analysis. Assume that lim s, = s and lim t, = t. So we can make |s,, — s| 
noo noo 


and |t, —t| as small as we want. Let ¢ > 0. We need to ensure that 


I(Sn +tn) —(s+t#)| <e 
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for sufficiently large n. Using algebra and properties of inequality on the expression 
\(Sn + tn) — (s + #)| we extract out |s, — s| and |t, — t| as follows: 


l(Sn + tn) — (s + #)| = |(sn — 5) + (tn — #)| by algebra 
< |Sn — 8| + |tn —¢| by triangle inequality. 


So, if |s, — s| < § and |t, —t| < §, then |(s, + t,) — (s+ t)| <e. Since lim Sn = 8, 
n CO 

there is a natural number N, such that |s, — s| < 5 when n > Ng. Similarly, there 

is a N; € N such that |t, —t| < § when n > N;. So we will let N = max{Ng, N;}, 

which will ensure that n > N, and n > N;, when n > N. We now present a valid 

proof that is guided by proof strategy 3.2.1. 


Proof. Suppose lim s, = s and lim t, =t. To prove that lim (s, +t,) =s5+t, let 
N—- Oo N—- Oo N—- Oo 


e > 0. Since Jim, Sn = 8, there is an N, € N such that 

[Sn — 8| < 5 for all n > Ns. (3.4) 
Since Jim, t, = t, there is an N; € N such that 

ltn — t| < 5 for alln > Ni. (3.5) 


Let N = max{N,, N;}. Thus, for n > N we have that 


l(Sn + tn) — (s+ #)| = |(sn — 5) + (tn — 8)| _ by algebra 
< |Sn — 8| + |tr —¢| by triangle inequality 


& & 
< 3 + 3 by (3.4) and (3.5) 
=€E by algebra. 


Therefore, |(S, + tr) — (s+ t)| < e for alln > N. 


In the proof of Theorem 3.2.2 we were able to “cleanly” extract out |s, — s| and 
|t, —t|, that is, there were no additional factors. This may not be the case when 
applying strategy 3.2.1 to prove other such theorems. There will be times, after 
extracting out |s, — s| or |t, —t|, that “unwanted” factors involving n, sp, or ty, will 
emerge. When such factors do appear, we will have to find an appropriate upper 
bound for these factors. This is done in the proofs of Theorems 3.2.3 and 3.2.6 below. 
In our proof analysis of Theorem 3.2.3 we obtain the unwanted factor |t,|. As the 
sequence (t,,) converges, there is a K > 0 such that |t,| < A for all n > 1. So we will 
have a desired upper bound. Actually, in our proof analysis, we tacitly use the value 
M = max{K,|s|} for the upper bound because it simplifies the proof slightly. 


Theorem 3.2.3. If lim s, =s and lim t, =t, then lim (s,t,) = st. 
nN—- Oo N—-0o noo 
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Proof Analysis. Assume that im, Sn = 8 and Jim, t, = t. Since (t,) converges, we 
know by Theorem 3.1.24 that t, 2, is bounded. Thus, there is an M > 0 such that 


|s| < M and |t,| < M for all n > 1. (3.6) 


We now use algebra and properties of inequality on the expression |s,t, — st| to 
extract out |s, — s| and |t, — t| as follows: 


|Sntyn — st| = |Spty — stn + sty — st| by algebra 
= |tn(Sn — 8) + S(tn —t)| by algebra 
< |tn(Sn — s)|+|s(tr —#)| by triangle inequality 
= |tn||sn — s| +|s||t, —t| by property of | | 
<M|sn—s|+M|tn—t| _ by (3.6). 


Thus, if |s, — s| < 347 and |t, — t| < 347, then we can conclude that |s,t, — st| < e. 
One can now compose a correct proof using this analysis as a guide. 


Proof. See Exercise 10. 


Our next lemma shows that if the limit of a sequence is nonzero, then the terms 
of the sequence are eventually nonzero as well. 


Lemma 3.2.4. If lim t, =t and t £ 0, then there is a real number 6 > 0 and an 
N EN such that 6 < |t| and 6 < |t,,| for alln > N. 


Proof. Assume that Jim t, =tandt #4 0. Let 6 = Be Clearly, 0 < 6 < |t|. Now, 
because Jim i = There is an N € N such that |t cof | < 6 for alln > N. Let 
n>WN. Hanes, \t| — |tn| < |t — ty| < 6, that is, |¢] — |t,| < 6. Thus, |é| — 6 < |t,| and 


so, |t| — ui < |tp|. As |é] — fl — = Hl = 6, we see that 6 < |t,|. 


The following lemma shows that if all the terms of a convergent sequence are 
nonzero and the limit is also nonzero, then there is a single positive number 7 smaller 
than the absolute values of the limit and all the terms of the sequence. 


Lemma 3.2.5. If lim t, = t where t £ 0 and t, #0 for all n > 1, then there is a 
y > 0 such that y < |t| and y < |t,,| for all n > 1. 


Proof. Assume that lim t, = t,t £0, and t, #0 for all n € N. By Lemma 3.2.4 


there is a 6 > 0 and an N € N such that 6 < |t| and 6 < |t,| for alln > N. Let 
y = min{|t,|, |to|,..., |éw|, d}. Clearly, y > 0, 7 < |t|, and y < |t,| for all n > 1. 


Our next theorem shows that if a convergent sequence (t,,) satisfies the conditions 


of Lemma 3.2.5, then the reciprocal sequence + also converges. 


Theorem 3.2.6. Let ¢ 40 and t, 4 0 for all n > 1. If lim tn = t, then lim eid, 


wrth ot 
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Proof Analysis. Assume that Jim. t, =t,t £0,andt, 4 0 for all n € N. Lemma 3.2.5 
implies that there is a y > 0 oul that (x) y < |t| and (xx) 7 < |t,| for all n > 1. 
We can now use algebra and properties of inequality on the expression Fs — 1| to 


extract out |t, — t| as follows: 


: : | — : tn by algebra 
ty ob tnt 
= ee by property of | | 
[tn | {el 
ltn = ¢| 


< 7 by («) and (xx). 


Thus, if we have that |t, — t| < ye, we can conclude that Fr — | < €. Guided by 
this analysis and proof strategy 3.2.1, we can now present a logically correct proof. 


Proof. Assume that t 4 0, tn, 4 0 for all n > 1, and lim ty, = t. By Lemma 3.2.5, 
there is a real number y > 0 such that y < |t| and y < |t,| for all n > 1. Thus, 


7? < |tnt| for all n > 1. (3.7) 
To prove that lim ~ = ‘, let ¢ > 0. Since lim tn =t, there is an N € N such that 
ltn —t| < ey” for alln > N. (3.8) 


Thus, for n > N we have that 


1 1] |t-t, rare” 
fo a ae ry algebra 
_ |t-thl 
a by property of absolute value 
|tn| || 
lt 7 tn| 
< 5 by (3.7) 


2 
Ey 


Therefore, 


+ — | <e for alln > N. This completes the proof of the theorem. 


Theorem 3.2.7. If Jim, $n = sand Jim. tn =twheret ¥ 0 andt, £0 for all n > 1, 


then Jim nf = 7. 


Proof. Assume that lim Sp = s and lim t, =t where t 4 0 andt, 40 for alln > 1. 


Since . = sn(#), Theorems 3.2.6 and 3.2.3 imply the desired conclusion. 
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3.2.2 The Squeeze Theorem 


Suppose that two convergent sequences have the same limit and the terms of a third 
sequence are eventually between the corresponding terms of the convergent sequences. 
We now show that this third sequence converges to the same limit as well. 


Theorem 3.2.8 (Squeeze Theorem). Let (s,,) and (t,) be convergent sequences such 
that lim Sp, = € and lim tn = €. If (yn) is a sequence satisfying 5, < Yn < tp for all 
n >m where m €N, then lim Unb 


Proof. Let (Sn), (Yn), (tn), £, m be as stated. Let « > 0. Since lim Sn = ¢, there is 


an N, € N such that 
|s, —£| < e€ for alln > Ng. (3.9) 


Since lim t, = @, there is an N; € N such that 
noo 


lt, —£| < efor alln > M,. (3.10) 


Let N = max{m, N,, Nz}. Let n > N. We now prove that |y, — ¢| < ¢. Since n > m, 
we conclude that sp) < Yn < ty. Thus, s, — 0 < yn, —& <t, — €. By Lemma 2.2.8, we 
infer that |y, — @| < max{|s, — 4], |tn — £|}. Since n > N, and n > M;, from (3.9) 
and (3.10) we see that max{|sp — |, |tn — £|} < €. Hence, |yn — ¢| < e. 


Theorem 3.2.8 will allow us to establish the limit of two important sequences. 
First, for c > 0, it is not obvious that the sequence (%/c) = (c, Wc, Wc, Wc,..-) 
converges, and if it does, then one could surmise that the limit depends on c. Our 
next result shows that the sequence does converge and that its limit is independent 


of c. Recall that */¢ = cn, for alln EN. 


Corollary 3.2.9. Let c > 0. Then lim en =1. 


Proof. First suppose that c > 1. Let d > 0 be such that 1+ d=c and let n > 1 be 
any natural number. Since d > 0, we see that d > —1. By Bernoulli’s inequality (see 
Exercise 5 on page 27), we have that 


d\” d 
1+-] >ltn-H=l+d=cPl. 
n n 


on 


Hence, (1+ 4) > cm > 1, for all n > 1. Since lim (1+ 4) =1 and lim do=a1, 
Theorem 3.2.8 implies that lim cn = 1. Now assume that 0 <c <1. So i > 1. The 


previous argument shows that lim + = 1. Thus, lim cn =1 by Theorem 3.2.6. 
NO cn Noo 


We know that the carne ) = (1,2,3,4,...) diverges. So, one would suspect 
that the sequence (%/n) = (1,72 73, V4, .. 2 also diverges. However, it converges. 


Corollary 3.2.10. lim nn =1. 
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Proof. Define the sequence (b,) by by = nn — 1, for alln € N. As (ni) = (bn + 1), 
to prove that lim nn = 1, it is sufficient to show that lim b, = 0 by Theorem 3.2.2. 


Since nz > 1, we see that b, > 0 for each n € N. Let n € N. Since te = bn + 1, we 
have n = (1+ 6,)”. Thus, by Corollary 1.4.12, for all n > 2 


—1 
=(1+0,)" > 14+ Dap 


Thus, 2 > 2 and ¥ > by > 0 for all n > 2. Since dn teed le lim 0 = 0, 


Theorem 3.2.8 implies that lim by, = 0. Hence, lim nn = = 1, by Theorem 3.2.2. 


We now present another “squeeze” result that will be applied later in the book. 


Lemma 3.2.11. Suppose that lim In = c. If a sequence (y,) satisfies |rn — Yn| < 4 
nr CO 
for alln > 1, then lim yp, =. 
noo 


Proof. See Exercise 4. 


3.2.3 Order Limit Theorems 


We next consider how an inequality that is satisfied by every term in a convergent 
sequence affects the limit of the sequence. We present several proofs in this section 
that show that the limit operation preserves the inequality relation <. Moreover, 
these proofs only involve the definition of convergence and some standard properties 
of inequality. 


Theorem 3.2.12. If lim tn = t where t, > 0 for all n EN, then t > 0. 


Proof. Suppose lim tr = t and (A) t, > 0 for all n € N. We will prove that t > 0. 
n CO 
Suppose, for a contradiction, that t < 0. Let « = —t > 0. Since lim tn = t, there is 
n CO 


an N € N such that |t, —¢| < ¢ for alln > N. Letn > N. Sot, —t < |t, —#| <e. 
Thus, t, —t < ¢ = —t. Hence, t, <t—t =0 which contradicts (A). 


Theorem 3.2.12 asserts that if a sequence with nonnegative terms converges, then 
its limit is also nonnegative. This fact implies the remaining results of this section. 


Theorem 3.2.13. If lim Sn = s and lim tn = t where s, < ty, for all n EN, then 
8<t. 


Proof. Suppose that lim Sn = §, lim t, = t and that s, < t, for all n € N. Hence, 
0 < (tn — Sn) for all n € N. Exercise 1 implies that lim (tra — Sn) = t—s. Thus, 
0 <t—s by Theorem 3.2.12. Therefore, s < tf. 


Theorem 3.2.14. Suppose that lim Sn = c. Let a and b be real numbers. 


1. If s, < b for alln EN, then c < 8. 
2. Ifa < s, for alln EN, thena<c. 
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Proof. Suppose lim Sn = c. To prove item 1, suppose that s, < 6} for all n € N. Since 
nr CO 


the constant sequence (b) converges to 6 and s, < b for all n € N, Theorem 3.2.13 
implies that c < b. A similar argument establishes item 2. 


Corollary 3.2.15. Let (s,,) be a sequence whose terms are all in [a, }]. If lim S=— G 


then c is in [a, 0]. 


Corollary 3.2.16. Let (s,,) be a sequence such that lim Sn = cand let M > 0. If 
|Sn| <M for alln EN, then |c| < M. 


Proof. If |s,| << M for alln € N, then —M < s, < M for all n € N. Theorem 3.2.14 
implies that —M <c< M, that is, |c| < M. 


It is important to realize that strict inequalities are not necessarily preserved 
under the limit operation. For example, 4 > 0 for alln EN, but lim 1 = 0. 
noo 


Exercises 3.2 
*1. Suppose lim s, = sand lim t, = t. Using proof strategy 3.2.1 as a guide, prove 
Noo N—- Oo 
that lim (s, — t,) = s—t. 
Noo 
2. Suppose lim s, = s and lim ¢t, = t. Let a,b € R be nonzero. Using proof 
N—- Oo Noo 


strategy 3.2.1 as a guide, prove that lim (asp, + btn) = as + Ot. 
nr CO 


3. Let (s,) and (t,) be sequences. Suppose that (s,) converges and 7 > 0 is such 


Sn 


a is bounded. 


that y < |t,| for all n > 1. Prove that the sequence ( 


“4. Prove Lemma 3.2.11. (Hint: |y, — cl = |Yn — tn + Ln — el.) 
5. Let lim (an + bp) = € and lim (an, — bp) = m for €,m € R. Use Theorem 3.2.2 


and Exercise 12 on page 67 to show that (a,) converges and evaluate its limit. 

6. Given that lim (Gn +b,) = @ and lim (Gn — bn) = m for £,m € R, use Exercise 1 
and Exercise 12 on page 67 to prove that (b,,) converges and evaluate its limit. 

7. Let (a,) be a bounded sequence and suppose that lim b, = 0. Prove that 
lim a,b, = 0. 
n—-0o 

8. Let 0 < r < 1. Show that lim Vil+r” = 1. Conclude that if « > 1, then 
lim Vl+a" =a. 

9. Let y € Rand Kk EN. Suppose that lim Sn = fand s, > y foralln > K. 
Prove that ¢> 4. 

*10. Prove Theorem 3.2.3. 


11. Let (an) and (b,) be sequences, and let @ € R. Suppose that an < ¢ < bp for all 
n> 1, and lim (dn — by) = 0. Prove that lim Gyn = € and lim b, = &. 
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n 


12. Let k € N. Show that Jim, (att) 1, 
13. Prove that jim (n+ ln =1. 

14. Prove that jim (n —l)r=1. 

15. Prove that dim (n? —l)r=1. 


16. Suppose lim Gn = € > p. Prove that there is a k € N such that a, > p for all 
n> k. 


17. Let (a,) be a sequence of positive terms such that Jim, ae > 1. Let pe R be 
such that Jim, er ped: 
(a) Show that there exists a k € N such that an41 > pay for all n > k. (Use 
Exercise 16.) 
(b) Prove, by induction, that apn > pax for all n > 1. 
(c) Conclude that (a,) is unbounded. 


18. Using Exercise 17 and Theorem 3.4.8, show that the sequence (=) is unbounded. 


n! 


19. Suppose that lim Sn = €. Prove that lim 4 So sp =F. 


Exercise Notes: For Exercise 8, 1 < 1+r” < (1+r”)” (see Exercise 1 on page 26). 
For Exercise 10, use proof strategy 3.2.1 and the proof analysis on page 70. For 
Exercise 17, use Exercise 24 on page 67. For Exercise 19, let Nj be so that |s, — ¢| < § 


N. 
when n > Nj, and let N > Nj be such that 4 3 ls, —€| < §, when n > N. 
k=1 


3.3 SUBSEQUENCES 


Given a sequence, we can create a new sequence, called a subsequence, by choosing 
certain terms of the given sequence and retaining the same order as in the original 
sequence. We now present a more formal definition of a subsequence. 


Definition 3.3.1. Let (s,)°2, be a sequence. Let ny < ng < n3 < +++ < mp <-: 
be a strictly increasing infinite sequence of natural numbers. The sequence (sp,)724 
is a subsequence of (5,,). 


Example. Consider the sequence (s,) = (51, $2, §3,...). Let ny = 2, no = 5, n3 = 6, 
na = 9, and ng < m5 < +++ < mg < +--+ continue to be a strictly increasing infinite 
sequence of natural numbers. We can now list the sequence (s,,) and the subsequence 
(Sn,) = (Sny3 Sno; Sng,---) as follows: 


S1,; $2, $3, §4, S5, 56, S57, S88, S9, 


Sn p] Sno ’ Sng p) Sn4 9 
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Example. For the sequence (+) = (1,5,4,4,---), let m1 < ng <+++< mp <-++ be 


the strictly increasing infinite sequence of natural numbers defined by nz = 2k. Thus, 


ny = 2, n2 = 4, nz = 6, and so on. Thus, sn, = s2 = 5; Sno = 84 = 7 Sn3 = 56 = 
1 


and so on. The resulting subsequence (sp,)?21 is (5, ‘, a ay 


1 
6? 


Remark 3.3.2. One can give an alternative definition of a subsequence. A function 
a: N > N is strictly increasing if and only if for all m,n € N, if m < n, then 
a(m) < o(n). Let (sp) be a sequence and let a: N > N be a strictly increasing 
function. The range of o produces the strictly increasing infinite sequence of natural 
numbers o(1) < a(2) < --- < a(n) < ---. Thus, (s,(n)) is a subsequence of (sp). 
Moreover, if (S,,) is a subsequence of (s,) as in Definition 3.3.1, then the function 
o: NN defined by o(k) = nx is strictly increasing and (sp,,) = (89(%)). In addition, 
(So(k)) = (So(n)); that is, (So(a))RL1 = (So(n)) met: 


Lemma 3.3.3. Let ny < no < ng <+++< np <--- bea strictly increasing sequence 
of natural numbers; that is, n, < ng41 for all & € N. Then for all k EN, & < ng. 


Theorem 3.3.4. If the sequence (s,,) converges to s, then every subsequence of (s,,) 
also converges to s. 


Proof. Suppose that lim Sn = 8 and let (s,,) be a subsequence of (s,,). To prove 
n CO 


that lim sp, = s, let ¢ > 0. Since lim s, = s, there is an N € N such that 
k-oo n—-+00 


|S, — 8| <€ foralln > N. (3.11) 


Let k > N. By Lemma 3.3.3, ng > k > N. Thus, (3.11) implies that |s,, — s| < e. 


The above theorem offers two methods for showing that a sequence diverges. 
Corollary 3.3.5. If a subsequence of (s,,) diverges, then (s,) diverges. 


Example. Consider the sequence (s,) = (1,2,1,3,1,4,1,5,1,6,...). This sequence 
has the divergent subsequence (2,3,4,5,6,...). Thus, (s,) diverges. 


Corollary 3.3.6. If a subsequence of (s,,) converges to x and another subsequence 
of (s,) converges to y where x # y, then the sequence (s,,) diverges. 


Example. Consider sequence (s,,) = (1, s, 1, § alk ‘, 1, : iv é ...). This sequence has 
the constant subsequence (1,1,1,1,1,...) that converges to 1, and the subsequence 
(5, 3 ‘, z ...) that converges to 0. Thus, (s,,) diverges. 


The above example shows that if a sequence (s,,) has a convergent subsequence, 
then one cannot conclude that (s,,) converges. However, there is a condition on such 
a subsequence which does imply that (s,) converges. 


Lemma 3.3.7. Let (s5,,) be a subsequence of (s,,) that converges to a real number . 
Suppose that for allk ¢e Nandn EN, ifng <n < ngyi, then either sp, < sp < Sn,4, 
OF Snz4, < Sn < Sn,. Then (sp) converges to £. 
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Proof. Let (Sy,) and (s,) be as stated. Let ¢ > 0. Since (s,,) converges @, there exists 
a Ky € N such that 
|Sn, —€|<eforallk> K. (3.12) 


Let n > nx. Now let k > K be such that ng <n < neq. So either sp, < 5p < Sn,,, 

OF Sn, < Sn < Sn,- Since K+ 1 > k > K, (3.12) implies that |sp, — ¢| < ¢ and 

[Sine - | < ¢. Lemma 2.2.11 implies that |s,, — ¢| < ¢. Hence, lim Sy = &. 
n CO 


If a sequence (s,,) does not converge to a real number @, then there is an ¢ > 0 
such that all N € N there is an n > N where |s,, — ¢| > € (see Remark 3.1.14). 
In fact, the following proposition shows that there is a subsequence (s,,) such that 
|Sn, — | > e for all of the terms in (s,,). This result can be used to show that if a 
bounded sequence does not converge, then it has two subsequences that converge to 
different limits (see Exercise 2 on page 86). 


Proposition 3.3.8. Suppose that the sequence (s,,) does not converge to the real 
number ¢. Then there is an ¢ > 0 and a subsequence (s,,,) such that |s,, — ¢| > € for 
all k > 1. Hence, no subsequence of (s,,) can converge to &. 


Proof. Suppose that (s,) does not converge to ¢. Thus, there is an ¢ > 0 such that 
for all N EN, there isan n > N such that |s,, — ¢| > ¢. For N = 1 let ny > 1 be such 
that |s,, — | > ¢; for N = nj, let no > n1 be such that |s,,, — ¢| > ¢; continuing with 
this method, we obtain a subsequence (s,,,) such that |s,, —¢| > for all k > 1. It 
follows that no subsequence of (s,,) can converge to ¢ (see Exercise 11). 


In the next proposition, we dovetail two sequences together to obtain a new 
sequence and provide a condition under which the new sequence will converge. 


Proposition 3.3.9. Let (s,), (tn) be sequences. Then (s,,) and (t,) both converge 
to @ if and only if the sequence (51, t1, 52, t2,-.-,Sn,tn,-.-) converges to &. 


Proof. Let (sp) and (tp) be sequences. 

(=). Assume that (s,,) and (t,) converge to ¢. Let ¢ > 0. Since (s,,) converges to @, 
there is an N; € N such that |s,, — ¢| < € for alln > Ny. As (tn) converges to @, there 
is an Nj € N such that |t, — ¢| < € for all n > No. Let N = max{N,, No}. For all 


n > N, we have that |s, — €| < € and |t, — ¢| < ¢. Thus, (81, t1, $2, te,...,8n,tn,---) 
converges to @ (see Exercise 13). 
(<=). If (51, t1, 52, t2,.--,Sn,tn,-.-) converges to @, then Theorem 3.3.4 implies 


that (s,) and (t,) also converge to @. 


With respect to convergence, the terms at the beginning of a sequence are of little 
importance. This observation motivates our next definition. 


Definition 3.3.10. Let (s,,)?2, be a sequence and let m > 0 be an integer. The 
subsequence (Sm4n)71 = (Sn)Pm41 is called a tail-end of the sequence (s,). 

age ie 
EYE STADE 


that the subsequence (343) = (9,2) % 7)---) is a tail-end of the sequence (+). 


Example. Consider the sequence (+) = (1 .). Let m = 3. Then we see 
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Since the convergence of a sequence does not depend on the terms at the beginning 


of the sequence, we have the following equivalence. 


Corollary 3.3.11. Let (s,) be a sequence and let m € N. Then (s,,) converges to @ 
if and only if (sm4n) converges to @. 


Proof. Let m € N. Assume that (s,) converges to @. Since (5m4+4n) is a subsequence of 
(Sn), Theorem 3.3.4 implies that (sm4) converges to ¢. The converse follows directly 


from Definition 3.1.1. 


Clearly, every tail-end of a sequence is a subsequence, but not every subsequence 


is a tail-end. 


Exercises 3.3 


1. 
2. 


Prove Lemma 3.3.3 by induction on k. 


Give an example of a sequence that is not bounded above but has a convergent 
subsequence. 


. Give an example of a sequence that is not bounded below but has a convergent 


subsequence. 


. Give an example of a bounded sequence that does not converge. 


5. Suppose that (s,,) is a sequence that is not bounded above. Show that there is a 


"11. 


12. 
*13. 


) 
subsequence (5y,)2, such that s,, >k for all k EN. 
) 


. Suppose that (s,,) is a sequence that is not bounded below. Show that there is a 


subsequence (5,,)?21 such that s,, <—k for allk EN. 


. Suppose that a sequence (s,,) converges to s € R and a sequence (t,) converges 


to t € R, where s # t. Explain why the sequence (51, t, 52, t2,...,Sn,tn,---) 
does not converge. 


. Let (s,,) be a sequence and let m € N. Suppose that the tail-end sequence (Sm+4n) 


converges to s. Prove that (s,,) converges to s. 


. Suppose that (s,,) has a subsequence that converges to x € R and another sub- 


sequence that converges to y € R, where x # y. Explain why the sequence (s,,) 
does not converge. 


. Suppose that the sequence (s,,) is bounded above and is also bounded below. 


Show that there is a real number K > 0 such that |s,| < A for alln EN. 


Let ¢ > 0 and let @ € R. Suppose that a sequence (s,,) satisfies |s, — ¢| > € for 
all n > 1. Prove that no subsequence of (s,,) can converge to @. 


Suppose lim s, =c. Let a: NN be one-to-one. Prove that lim s,(,) = c. 
noo n-oco 
Suppose that lim s, = @and lim t, = ¢. Define the sequence (un) by 
noo noo 


Sni1, if nis odd; 
2 


t 


Un = 
if n is even. 


a2 
2 d 


Sequences m 79 


Using Definition 3.1.1, prove that lim Un = ¢. 


*14. Let (s,) be a sequence. Prove that (s,) converges to @ if and only if the two 
subsequences (82) and (S21) converge to @. 


15. Suppose lim Gn = € < p. Prove that there is a k € N such that a, < p for all 
n>k. 


*16. Let (a,) be a sequence of positive terms such that lim ont < 1. Let p € R be 
such that lim See pe, 
(a) Show that there exists a k € N such that an41 < pay, for all n > k. (Use 
Exercise 15.) 
(b) Prove, by induction, that ap, < pax for all n > 1. 


(c) Conclude that lim dg =O: 


17. Let 0 < |r| <1landk €N. Use Exercise 16 to show that the sequences converge 
to 0. 


(a) (27) 
(b) (2) 


(c) (nr). 
18. Leto: NN and 7: NN be strictly increasing. Prove that (007): NN 


is strictly increasing. Let (s,,) be a sequence. Using Remark 3.3.2, conclude that 
(a) (So(7(n))) is a subsequence of (s,,), and (b) (S5(7(n))) is a subsequence of (5,5(n))- 


Exercise Notes: For Exercise 5, Proposition 3.1.20 allows one to construct the 
required subsequence as follows: For k = 1 let n, be such that s,, > 1; for k = 2 let 
n2 > ny be such that s,, > 2; etc. For 5(b), Proposition 3.1.22 allows one to construct 
the desired subsequence as well. For Exercise 12, observe that for any N € N, the set 
{n © N: o(n) < N} is finite. This exercise implies that if a sequence (s,) converges 
to c, then any sequence obtained by reordering the terms in a subsequence of (s,)) 
also converges to c. For Exercises 13 and 14, given the appropriate N, and No, 
let N = max{2N, — 1,2N2}. Note: n is odd iff n = 2k — 1 for some k € N. For 
Exercise 17(c), see Corollary 3.1.12. For Exercise 18(b), let (tn) = (So(n)), where 
tn = So(n) for alln € N. 


3.4 MONOTONE SEQUENCES 


A sequence is increasing if each term is less than or equal to the term after it. A 
sequence is decreasing if each term is greater than or equal to the term after it. 


Definition 3.4.1. A sequence (s,) is an increasing sequence if for all n € N, we 
have that 5, < S,41. A sequence (s,,) is a decreasing sequence if for all n € N, we 
have that s, > 5,41. A sequence is monotone if it is either increasing or decreasing. 
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Thus, if sequence (s,,) is increasing, then s, < s,, whenever n < m, and if (s,) is 
decreasing, then s, > S, when n <m. 
Example. The sequences (1 — 1), (3n), and (1,1,2,2,3,3,...) are increasing. The 
i (—1)" 


n n 


sequence (+) is decreasing, and the sequences ((—1)”) and ( ) are not monotone. 


An increasing sequence that is not bounded above does not converge; for example, 
the sequence (3n) diverges. However, if a sequence is increasing and bounded above, 
then the terms of the sequence eventually cluster together with the supremum of the 
set of these terms (see Figure 3.4). 


Lemma 3.4.2. If (s,,) is increasing, bounded above, and 3 = sup{s, :n € N}, then 
lim s, = B. 

noo 

Proof. Assume that (s,,) is increasing and bounded above. Let S = {s, :n € N} and 
6 = sup(S). Thus, s, < 6 for all n € N. To prove that lim Sn = 8B, let ¢ > 0. As 


B—e < , there isan N € N such that G—e < sy. Thus, (A) G—sy <e. Letn > N. 
So sy < S, and (@) 6 — 8s, < B— sy. We now prove that |s, — 6| < € as follows: 


[Sn — B| = 8 - Sn because s, < 8 
<6B-—sy <e_ by (A) and (@). 


upper bound 


$1 $2 83 $455 +: f 


Figure 3.4: Representation of the proof of Lemma 3.4.2, where S = {s, :n € N}. 


Corollary 3.4.3. If (s,) is increasing and lim Syn = §, then s, < s for alln > 1. 


Proof. Suppose that (s,) is increasing and lim Sn = s. Then s = sup{s, : n € N} 
by Lemma 3.4.2. Thus, s, < s for all n > 1. 


Our next lemma (see Exercise 4) shows that if a sequence is decreasing and 
bounded below, then the limit of the sequence is the infimum of the set of its terms. 


Lemma 3.4.4. If (s,) is decreasing, bounded below, and a = inf{s, :n € N}, then 
lim Sn = Q. 


Corollary 3.4.5. If (sp) is decreasing and lim Sn = 8, then s, > s for all n > 1. 


We can now show that a bounded monotone sequence converges. 
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Theorem 3.4.6 (Monotone Convergence Theorem). Suppose that (s,,) isa monotone 
sequence. Then (s,,) is convergent if and only if (s,,) is bounded. 


Proof. Let (s,) be a monotone sequence. If (s,) is convergent, then Theorem 3.1.24 
implies that the sequence is bounded. If (s,) is bounded, then either Lemma 3.4.2 or 
Lemma 3.4.4 implies (s,) is convergent. 


Problem 3.4.7. Consider the sequence (s,) which is inductively defined by s; = 2 
and (A) Sn41 = V64 5, for all n > 1. Prove by induction that the sequence is 
monotone and bounded. Using the Monotone Convergence Theorem show that the 
sequence (s,,) converges and then evaluate its limit. 


Solution. We begin by proving the following proposition. 
Proposition. For every natural number n > 1, 0 < 5y) < Sn441 < 10. 


Proof. We use mathematical induction. 


Base step: For n = 1, we have that s; = 2 and sp = 6+ 5; = V8 = 2vV2. Thus, 
0< sy <5 < 10. 


Inductive step: Let n > 1 and assume the induction hypothesis that 
0 < Sy < Spi < 10. (1H) 


We prove that 0 < Sn41 < Sn42 < 10. Note that (A) implies s,4; = 6+ 5, and 


Sn42 = V6 F+ Sn41. Thus, 
0 < Sy < Spi < 10 by (IH) 
0<64+ 8, <64+ Sn41 <6+4 10 by prop. of inequality 
0< V6+ sn < V64 Sn41 < V16_ by prop. of inequality 
O0< Sn41 =< Sn42 < 4 by (a). 


Hence, 0 < Sn41 < Sn42 < 10. This completes the proof of the proposition. 


Thus, (s,,) is monotone and bounded and so, lim Sn = 8 exists by Theorem 3.4.6. 
Since Sn41 = 6+ s,, we conclude that (¥) sy =6+s,. Moreover, lim Snt1 = § 
by Theorem 3.3.4. Hence, lim s?,, = 8” by Theorem 3.2.3. Thus, 


2 ; 2 : 
s* = lim snyi = jim (6 +8) by (¥) 
=6+ lim s, _ by Exercise 11 on page 67 
noo 
=6+5 because lim s, = s. 
noo 
Therefore, s? — 5s —6 = 0. The roots of this equation are s = —2,3. Since s,, > 0 for 


all n > 1, Theorem 3.2.12 implies that s = 3. So, lim Sn = 3. 
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Consider the sequence ((1 + 1)n), called Euler’s sequence. Does it converge? The 
answer to this question is not at all obvious. In fact, the sequence does converge and 
this will be verified in the following proof by appealing to the Monotone Convergence 
Theorem 3.4.6, a very important theoretical tool. 


Theorem 3.4.8. The sequence ((1 + 4)n) converges to a real number ¢. Moreover, 
2<£<3. 


Proof. We shall show that ((1 + 4)n) is increasing, bounded above by 3, and bounded 
below by 2. By Theorem 1.4.11, for all natural numbers m > 3, we have that 


Oa) ETE (Dae 


(8 (a) 


One can check that (a) 2 = (1+ ¢)! < (1+ 5)? < (14+ 4)®. We will now show that 
(1+ 4)p< (1+ a ee for all n > 3, as follows: 


(3.13) 


1\” ™[n\ 1 
(ee) Set a by (3.13 
(et) re ‘au 
ea 1 2 k-1 
=2+55 (1 )( }--(- ) by Exercise 8 
pa n n n 
re il 1 2 k-1 
<2 1 1 --(1l— . Prin. 3.1.3 
Sra —) ( —;) ( —) wipes 
"f{n+1 1 
= 243>( ; by Exercise 8 
rae ie pe) 
" f{n+1 1 1 1 
<2+(50 eee S| 
k n n 
XL Jee) > Gey @rA 
n+1 
n+1 1 
—i[+1+ ( Feesre by (3.13) 
jen Ne tee) 
1 n+l 
al We by (3.13). 
(1+) y (3.13) 


Therefore, (1 + i)n < (1+ Sea Hence, the sequence is increasing. Moreover, 
(A), (3.13), and Exercise 8 imply that for n > 3, 


ee ey ok “(n\ 1 “1 
2<(1+=) = 3 ({)an2+d (fla st+ do ger <3 


k=0 =2 k=2 


(see Theorem 1.4.7 and the Shift Rule 1.4.8). Theorems 3.4.6 and 3.2.14 now imply 
that ((1+ +)”) converges to a limit € where 2 < £ <3. 
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3.4.1. The Monotone Subsequence Theorem 


In this section we shall prove that every sequence of real numbers has a monotone 
subsequence. First we define what it means for a term in a sequence to be a “peak.” 


Definition 3.4.9. Let (s;,) be a sequence. Let s,, be the mth term of this sequence. 
We say that s,, is a peak if s,, > s, for all n > m (see Figure 3.5). 


Let 5m be the mth term of the sequence (s,). Then 5s, is a peak if it is strictly 
greater than every term that follows it. On the other hand, s,, is not a peak if s,, is 
less than or equal to some subsequent term; that is, 5m < s, for some n > m. 


° 
) i e 
‘ ° eo e | eo —— —— 
: @ ee e : 
ee : e ‘ 
’ ee oe i © sents. 2 © inant 
. a . 
Prrrrrrrrrerere eter fi rT 
1234567891011 .-.---- n n+l 


Figure 3.5: 57, S14, and s, are peaks; s4 and sg are not a peaks. 


Example. Consider the sequence (s;,) = (". So, (Sn) = (1,5, 7 i, é ) 
Thus, s4 = F is a peak and ss = —; is not a peak because s5 = —: < 7 = S¢. Let 
P ={8m: 8m is a peak}. Then P = {5,4,%,...} and P is infinite. 


Example. Consider the sequence (s;,) = pp iss ashe 
and s3 = $ is not a peak since s3 = $ < 3 = 85. Let P = {Sm : Sm is a peak}. Then 


P = {2,1} and P is finite. 


Theorem 3.4.10 (Monotone Subsequence Theorem). Every sequence of real num- 
bers has a monotone subsequence. 


Proof. Let (sn) be a sequence of real numbers. Let P = {5 : Sm is a peak}. There 
are two cases to consider: Either P is infinite or P is finite. 


CASE 1: P is infinite. Since P is infinite, we can construct a subsequence of (s,), 
consisting of peaks, as follows: Let m1 be the first index such that sm, is a peak. Let 
mz be the smallest natural number larger than m, such that s,,, is a peak. Since P is 
infinite, we can continue in this manner obtaining m, < mg < m3 <-+-+< mz <--> 
where the subsequence (s,,,) is such that each s,,, is a peak. Thus, as each s,,, is 
a peak, we conclude that sm, > 5m, > Sm; > ++: > Sm, > +: and thus, we have 
constructed a monotone subsequence. 


CASE 2: P is finite. Since P is finite, let P = {5m,, 5m ,---;Sm,} be a finite listing of 
all the peaks. We can construct a subsequence of (s,,), consisting of terms that are not 
peaks, as follows: Let n, be larger than all of natural numbers in {m1,™mz2,...,m,}. 
Thus, sy, is not a peak and that s, is not a peak for all k > n 1. Since sp, is not a 
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peak, there is a natural number ng > n; where 8,, < S,,. Now, since s,, is not a 
peak, there is a natural number n3 > n2 such that s,, < s,,. We can continue in this 


manner obtaining ny < ng < ng < ++: < mp < +--+: where the subsequence (sp,) is 
such that each term s,, is not a peak and sy, < Sn, < Sn, S++: < Sn, <---. Hence, 


we have a monotone subsequence. 


Exercises 3.4 


1. 
2. 


*A, 


Give an example of a sequence that converges and is not monotone. 


Prove that each of the following sequences is monotone and bounded: 


(a) (3) ) (4) © (4). 


. Show that the sequence (n + cy") is increasing and does not converge. 


Prove Lemma 3.4.4. 


5. Prove Corollary 3.4.5. 


*8. 


10. 


11. 


. Inductively define the sequence (s,) by 5; = 1 and spj44, = +(25n + 3) for all 


n > 1. Show that (s,) converges and then find its limit. (See Problem 3.4.7.) 


. Inductively define the sequence (s,) by s; = 1 and s,4, = 3— + for all n > 1. 


Show that (s,,) converges and then find its limit. (See Problem 3.4.7.) 
Let m€ Nand k €N be such that m > k > 2. Using algebra, show that 


pea) ane 


Now conclude that (7) +e < @ < get. 


. Let (s;,) be a monotone sequence with a subsequence (s,,,) that converges to @. 


Prove that (s,) converges to @. 


Let A C R be nonempty and bounded. Let 6 = sup(A). Thus, for each n € N, 
there is a b, € A such that 6 — 4 < bn. By Theorem 3.4.10 the sequence (b,) has 
a monotone subsequence (bn, ). 


(a) Show that lim i= 7 
(b) Prove that lim b,, = 6. 
k-00 
(c) Suppose that 6 ¢ A. Prove that (b,,) must be an increasing sequence. 


Let A C R be nonempty and bounded. Let a = inf(A). Thus, for each n € N, 
there is an a, € A such that a, <a+ 1 By Theorem 3.4.10 the sequence (a,) 
has a monotone subsequence (dn, ). 


(a) Show that lim (rer 
(b) Prove that lim ap, = a. 
k-0o 


(c) Suppose that a ¢ A. Prove that (an,) must be a decreasing sequence. 
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3.5 BOLZANO-WEIERSTRASS THEOREMS 


Bolzano—Weierstrass Theorem for sequences is a fundamental result which states that 
each bounded sequence has a convergent subsequence. This theorem is named after 
the mathematicians Bernard Bolzano and Karl Weierstrass. It was first proved by 
Bolzano, but his proof was lost. It was re-proven by Weierstrass and has become an 
important centerpiece of analysis. 


Theorem 3.5.1 (Bolzano—Weierstrass Theorem for Sequences). If the sequence (s,) 
is bounded, then (s,) has a convergent subsequence. 


Proof. Let (s,) be a bounded sequence. By Theorem 3.4.10, there exists a monotone 
subsequence (S»,). Since (s,) is bounded, it follows that (s,,) is bounded. As (sp,) 
is a bounded monotone sequence, Theorem 3.4.6 implies that (s,,,) converges. 


Given a point  € Randa set S CR, item 1 of our next definition can be viewed 
as addressing the question: What does it mean to say that x is “close” to S? 


Definition 3.5.2. Let S be a subset of R. 


1. A point x € R is an accumulation point of S if every neighborhood of x 
contains an infinite number of points from S; that is, if U* is any neighborhood 
of x, then SMU” is infinite. 


2. A point x € R is an isolated point of S if x € S and z is not an accumulation 
point of S; that is, there is a neighborhood U* of x such that SM U* = {2}. 


A point x € Ris an accumulation point of a set S if for every neighborhood of x 
(no matter how small), there are an infinite number of points from the set S' that are 
in the neighborhood. So if J is an interval and either x € J or x is an endpoint of J, 
then x is an accumulation point of J. 

A point x € S is an isolated point of S if there is a neighborhood of x in which 
there are no other points from the set S (z is all alone; i.e., x is the only point from 
S living in this neighborhood). 


Remark. An accumulation point of S may be in the set S or it may not be in S. 
On the other hand, an isolated point must be in S’. 


Example. For S = [0,3) U {4}, we see that every point x € [0,3] is an accumulation 
point of S whereas, 4 is an isolated point of S. For S = Q, as Q is dense in R, we see 
that every point x € R is an accumulation of S. 


We will next show that a bounded infinite set of real numbers has at least one 
accumulation point. 


Theorem 3.5.3 (Bolzano—Weierstrass Theorem for Sets). Let S C R be infinite. If 
S is bounded, then there is a point x € R such that x is an accumulation point of S’. 
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Proof. Let S C R be infinite and bounded. As S is infinite, there is a sequence (s,,) 
of distinct points in S, that is, s, € S for all n € N. Since S is bounded, the sequence 
(Sn) is also bounded. Theorem 3.5.1 implies that (s,) has a convergent subsequence 
(Sn,). Let x be the limit of this subsequence (s,,). Corollary 3.1.17 implies that every 
neighborhood of x contains an infinite number of points from the subsequence (Sn, ). 
Since each s,, € S, it follows that x is an accumulation point of S. 


Theorem 3.5.4. Let S C R and x be an accumulation point of S. Then there is a 
sequence of distinct points (s,,) in S that converges to x. 


Proof. Let x be an accumulation point of S'. Recall that for any n > 1, a real number 
a is in the neighborhood (x — +, + +) if and only if Ja—2| < +. Now, for each 
n €N, we have that the (x — - o+ 1) contains an infinite number of points in S. 
We now define a sequence with distinct points as follows: For n = 1 choose s, € S 
such that |s; — z| < +. For n = 2 choose an s2 € S so that s2 4 s; and |s2 — 2| < 5. 
For n = 3 choose an s3 € S so that s3 4 81, sg and |s3 — 2| < 5 Continuing in this 
manner, we obtain a sequence (s,) of distinct points in S such that |s, — 2| < + for 


all n > 1. Theorem 3.1.11 implies that the sequence (s,,) converges to x. 


Exercises 3.5 


1. Can you find a sequence (s,) such that 1 < s, <5 for all n > 1 where (s,) has 
no convergent subsequence? 


*2. Let (s,) be a bounded sequence that does not converge. By Theorem 3.5.1 there 
is a subsequence (s,,,) that converges to some real number ¢. Show that there is 
another subsequence of (s,,) that converges to a real number different than @. 


3. Let (s,) be a bounded sequence. Suppose that every convergent subsequence of 
(Sn) converges to the same value ¢. Explain why Exercise 2 implies that (s,) 
must converge. 


4. For each of the following subsets S' of R, find some accumulation points (if any) 
and find some isolated points (if any). 


(a) .S = [0,:3). (c) S={4:neEN}. 
(b) SN, (d) S={qEQ:0<q<l}. 
5. Let I C R be the set of irrational numbers. Find the set of all accumulation 
points of I. 
6. Let S={reEQ:2<]}. 
(a) Show that 1 is an accumulation point of S. 
(b) Define a sequence (x,,) of points in S' that converges to 1. 
7. Let S={aJ/2:2 € Qand z < 1}. 
(a) Show that /2 is an accumulation point of S. 
(b) Define a sequence (x,) of points in S that converges to V2. 
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8. Let S C R. Suppose that (s,,) is a sequence of distinct points in S that converges 
to x. Show that x is an accumulation point of S. 


9. Let (s,) be a sequence satisfying |s, — S| < M for all n,m > 1, where M > 0. 
Prove that (s,) has a convergent subsequence. 


10. Let (s,) be a sequence such that a < s, < b for infinitely many n € N, where a 
and b are real numbers. Show that (s,,) has a convergent subsequence. 


11. Let S be a subset of R, and let x € R. Suppose that every neighborhood of x 
contains a point from S' that is different from x. Prove that every neighborhood 
of x contains an infinite number of points from S' that are all different from 2. 


Exercise Notes: For Exercise 2, Proposition 3.3.8 implies that there is a subsequence 
(Sn,) that does not converge to ¢. This subsequence is bounded. For Exercise 11, let U? 
be a neighborhood of x, where ¢ > 0. Suppose that (U? \{z})NS = {u1, us,..., Un} 
is finite. Let e* = min{|u) — z|,|u2 —2|,...,|Uun — z|} > 0. Show that U2. C U® and 
each u; ¢ U%. Derive a contradiction. 


3.6 CAUCHY SEQUENCES 


One of the problems with deciding if a sequence converges is that we need to have 
a purported limit before we can apply the limit definition. Augustin Cauchy found 
a way around this problem, called the Cauchy Convergence Criterion. A sequence 
that satisfies this criterion is said to be a Cauchy sequence. The formal definition of 
a Cauchy sequence, below, makes no reference to a limit; but, it does require that 
the terms of the sequence become arbitrarily close to each other as the sequence 
progresses. 


Definition 3.6.1. A sequence (s,,) is a Cauchy Sequence if for every ¢ > 0 there 
exists an N € N such that for all m,n EN, if m,n > N, then |s, — 5| < e. 


Proof Strategy 3.6.2. To prove that a sequence (s,) is Cauchy, use the proof 
diagram: 
Let ¢ > 0 be an arbitrary real number. 
Let N = (the natural number you found). 
Let m,n > N be arbitrary natural numbers. 
Prove |Sn — Sm| < €. 


Lemma 3.6.3. Every convergent sequence is a Cauchy sequence. 


Proof. Let (s,) be a convergent sequence. Let lim Sp = 8. To prove that (s,) isa 
noo 


Cauchy sequence, let ¢ > 0. Since lim s, = s, there is an N € N such that 
noo 


for alln EN, ifn > N, then |s, — s| < - (3.14) 
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Now let m,n > N. We shall prove that |s;, — 5m| < ¢ as follows 


|Sn — Sm| = |(Sn — 8) + (S$ —Sm)| by algebra 


= |sn — s| + |s — S| by triangle inequality 
Sa as by (3.14) 


Thus, the sequence (s,) is a Cauchy sequence. 


We know by Theorem 3.1.24 that a convergent sequence is bounded. The same is 
true of a Cauchy sequence. 


Lemma 3.6.4. Every Cauchy sequence is bounded. 


Proof. Let (s,) be a Cauchy sequence. Let ¢ = 1 and N € N be so that |s, — 5| <1 
holds for all m,n > N. Thus, 


[sn] — [8m] < [Sn — $ml < 1 


for allm,n > N. Let mo be any fixed natural number mo > N. Hence, |sn| < [Smo|+1 
for alln > N. Let M = max{|si|,...,|5~|,|Smo| + 1}. We see that |s,| <M for all 
n €N. Therefore, (s,,) is a bounded sequence. 


In the proof of the following theorem, Lemma 3.6.3 is used to conclude that a 
convergent sequence is a Cauchy sequence. Lemma 3.6.4 and the Bolzano—Weierstrass 
Theorem 3.5.1 are then used to prove the converse. 


Theorem 3.6.5 (Cauchy Convergence Criterion). Let (s,,) be a sequence. Then (s,,) 
is convergent if and only if (s,) is a Cauchy sequence. 


Proof. If (sn) is convergent, then Lemma 3.6.3 implies that (s,,) is a Cauchy sequence. 
To prove the converse, assume that (s,,) is a Cauchy sequence. Lemma 3.6.4 implies 
that (s,) is a bounded sequence. Theorem 3.5.1 implies that (s,) has a convergent 
subsequence (s,,). Let x be the limit of this subsequence (s,,). We now prove that 
the sequence (s,,) also converges to x. To do this, let ¢ > 0. Since (s,) is a Cauchy 
sequence, there is an N € N such that 

€ 


for all m,n EN, if m,n > N, then |s, — sm| < 5 


(3.15) 


Let n > N. Because z is the limit of the subsequence (s,,), there is a natural number 
nz > N such that (A) |Sn, — 2| < 5. Therefore, 


|Sn — £| = |(Sn — Sn,) + (Sn, — £)| by algebra 


< |8n — Sn, | + |Sn, — 2| by triangle inequality 
é  € 
< 5 + 37 € by (3.15) and (a). 


Thus, |s, — z| < ¢. Therefore, (s,,) converges. 
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Theorem 3.6.5 allows one to confirm that a sequence converges without knowing 
the value of its limit. Since the proof of Theorem 3.6.5 applies Theorem 3.5.1, the 
Cauchy convergence criterion follows from the completeness axiom. 

The following lemma can be a used to show that certain sequences are Cauchy. 


Lemma 3.6.6. Let (s,,) be a sequence and let a > 0. Suppose 0 < r < 1 is such that 
ISn41 — Sn| < ar” for alln > 1. (3.16) 
Then (s,,) is a Cauchy sequence and hence, converges. 


Proof. Let (sy), a, and r be as stated in the lemma. Corollary 3.1.12 implies that 
lim ar” = a lim r” = 0. By Theorem 3.6.5, (ar”) is a Cauchy sequence. Now let 
noo N—- Oo 

é > 0. Since (ar”) is a Cauchy sequence, there is a natural number N such that 
(A) jar” — ar™| < e(1—1) for all natural numbers m,n > N. Let m,n > N. We can 
assume that m > n. We show that |5m — 5,| < ¢ as follows: 


lSm — Sn| = |(Sn41 — $n) + (Sn42 — Sn41) +++: + (8m — Sm—1)| by algebrat 
< |Sn41 — Sn] + |Sn4a — Sn4il +--+ + |Sm — Sm-1| by triangle inequality 
< ar” part ns part by (3.16) 
=ar"(ltrtr? tect by algebra 
fat 
= ar” | ——_—_ by Theorem 1.4.7 
l-r 
ar” — ar™ 
== by algebra 
l-r 
e(l-r 
< elt) =€ by (a). 
l-r 


Problem. Let (a) be a bounded sequence and let x € R be such that 0 < |a| < 1. 


n 
Consider the sequence (s,) where s, = >> azx*. Prove that (s,) converges. 
k=1 


Proof. We show that (s,,) is a Cauchy sequence by applying Lemma 3.6.6. Since (ax) 
is bounded, there is an M > 0 such that |a,| < M for all k € N. Let n € N. Then 


n+l 


n 
|snt1 — Sn] =]o ana* — S7 aga! = fangaz™ | = lanss| [al”*? < (M |x) fal”. 
k=1 k=1 


So, by Lemma 3.6.6, (s,) is a Cauchy sequence and converges by Theorem 3.6.5. 


In the above proof, upon showing that (s,,) is a Cauchy sequence, Theorem 3.6.5 
allowed us to infer that (s,) converges without knowing its limit! Cauchy sequences 
play an important role in real analysis and they will surface again in the text. 


'For example, s7 — 83 = (sa — 83) + (85 — sa) + (86 — 85) + (87 — 86). 
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Exercises 3.6 


1. Using Definition 3.6.1, prove that (83) is a Cauchy sequence. 
2. Prove that any subsequence of a Cauchy sequence is also a Cauchy sequence. 


3. Let (s,) be a Cauchy sequence. Let (s,,,) be a subsequence that converges to @. 
Prove that (s,,) also converges to @. 


4. Let (s,) be a Cauchy sequence and let k > 0. Let (tn) be a sequence satisfying 
Itn —tm| < k|Sn — 5m| for all n,m > 1. Prove that (t,) is a Cauchy sequence. 
5. Suppose that the sequence (s,) is such that |s, — 8m| << — for all m,n EN. 
(a) Prove that (s,,) is a Cauchy sequence. 
(b) Prove that (s,) is a constant sequence. 


6. Suppose that (s,,) and (t,) are Cauchy sequences. Using Definition 3.6.1, prove 
that (sp, + tn) is a Cauchy sequence. 


7. Suppose that the sequence (s,) satisfies |sn41 — Sn| < for all n > 1. Show 


Gt 
that (s;) is a Cauchy sequence. 


n 
8. Consider the sequence (s,) where s, = py p=l+gt+qt-t+ a. Using 
=1 
Exercise 7 show that the sequence (s,,) converges. 


9. Let (s,) be a sequence where s; 4 sg. Let 0 < r < 1 and suppose that 
[Sn42 — Sn4i] <7 |Sn41 — Sn| for all n > 1. 


Prove the following: 
(a) |Sn41 — 8n| < 7"! |sq — 81| for all n > 1, by induction. 
(b) (sp) is a Cauchy sequence. 


10. Inductively define the sequence (s,,) by 5; =c > 1 and sp41, = we for alln > 1. 
Observe that s,, > 0 for alln > 1. 


(a) Using Exercise 9, prove (s,) is a Cauchy sequence. 


(b) Evaluate lim sy. 
N—- Oo 


y nS m3 _ n43 1+1 “its = n43 1|+ 
1 — tts. For Exercises 2 and 3, use Lemma 3.3.3. For part (b) of Exercise 5, let 
£ be the limit of the sequence. Show that s, = @ for all n, by first showing that 
[Sn le _ [8m — | < |S) —5»| for all m,n € N. For Exercise 7, first show that 


nae < oa for alln > 1. 
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3.7 INFINITE LIMITS 


An unbounded sequence does not converge, but some unbounded sequences “take off” 
to infinity; that is, the terms of the sequence become, progressively, larger and larger 
without bound. In other words, there are unbounded sequences (s,,) whose terms Sp, 
become arbitrarily large as n becomes large. For example, consider the unbounded 
sequences 


(2 BAS 6s: OA AB 6.5.6 Ta ny ad 10,2, 058.0. 4.4 2) 


In the first two sequences, the terms are eventually all getting larger and larger 
without bound, whereas this is not the case for the third sequence. Here is a precise 
definition of this notion. 


Definition 3.7.1. Let (s,,) be a sequence. 


e We say that (s,) diverges to co provided that for every M > 0 there exists an 
N €N such that for all n € N, ifn > N, then s, > M. In this case, we shall 
write lim s,, = oo. 

noo 
e We say that (s,,) diverges to —oo if for every M < 0 there exists an N € N such 


that for alln € N, ifn > N, then s, < M. When this is the case, we will write 


lim s,, = —oo. 
noo 


Problem. Prove that. lim (n? — 3n + 2) =o. 


Proof Analysis. Let M > 0. We must find an N such that n? —3n+2 > M for all 
n> N. Note that n? —3n4+2 > n? —3n =n(n—- 3) > n—3, when n—3 > 0. So we 
need n — 3 > M. Solving for n, we need n > M + 3. Thus, we let N > M+3. We 
can now give a proof. 


Proof. We prove that lim (n? — 3n + 2) = oo. To do this, let M > 0. Let N EN be 
such that N > M+3. Let n > N bea natural number. Then 


n? —3n+2>n?-—3n by property of inequality 
= n(n — 3) by factoring 
>n—-3 because n > 4 
>(M+3)—3  becausen>M+3 
=M by algebra. 


Therefore, n? — 3n +2 > M, for all natural numbers n > N. 


Theorem 3.7.2. Let (s,,) and (t,) be sequences. Suppose that K € N is such that 
Sn < ty for all n > K. Then the following hold: 


(a) If lim sp, =o, then lim t, =o. 
N—- Oo noo 


(b) If lim t, = —oo, then lim s, = —oo. 
n—0oo N00 
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Proof. Let (sp) and (t,) be sequences such that s, < t, for all n > K. To prove (a), 


assume that Jim, Sn = oo. Let M > 0. Since Jim, Sn, = 00, there exists an N’ € N 


such that sy, S .M for alln > N’. Let N = max{N’, K}. As sy < ty for alln > K, 
we conclude that M < sy, < ty, for alln > N. So t, > M for all n > N and thus, 


lim tn = 00. The proof of (b) is similar. 


Theorem 3.7.3. Let (s,,) is a sequence of positive terms. Then lim Sp, = 00 if and 
n>Co 

only if lim + =0. 

N—- Oo 


Sn 


Corollary 3.7.4. Let « > 1. Then lim x” = co. 


noo 


Proof. Let x > 1. Thus, 0 < 4 < 1. Corollary 3.1.12 implies that lim + = 0. Thus, 
Theorem 3.7.3 implies that lim L =00. 


Exercises 3.7 
1. Prove that lim nN = OO. 
. Prove that lim “+! = oo. [Hint: Show that Y%+! > /n when n > 1.] 
noo Vn vn 
. Prove that lim —n+sin(2n) = —oo. 
- Prove that lim n—4/n = oo. [Hint: Show that n — 4,/n > /n when n > 25.] 


. Let c > 0. Suppose that lim s, = oo. Prove that lim cs, = oo. 
noo noo 


. Prove Theorem 3.7.2(b). 


2 
3 
4 
5 
6. Suppose that Jim, Sn = oO and Jim, ty, = co. Prove that jim (sn +ty) = 
7 
8. Prove Theorem 3.7.3. 

9. Let (s,) be bounded below. Prove that if lim tn = 00, then jim (Sn + ty) = 00. 
*10. Let (an) be a sequence of positive terms. Suppose im, ae =o. 


Prove that there exists a K € N such that a,41 > a, for alln > Kk. 


(b) Show that a, > ax for alln > K +1. 
(c) Let c= min{ay, a2,...axK}. Conclude that c > 0 and a, > ¢ for alln EN. 
(d) Show that there is a real number a > 0 such that cn >a for alln EN. 


Let p > 0. Show that there isa N € N such that an41 > (4)@n for alln > N. 
Prove, by induction, that a, > (£)"an for alln > N +1. 
Infer that an > (£)"c for alln > N +1. 


Conclude that lim (a@m)* = 00. 
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3.8 LIMIT SUPERIOR AND LIMIT INFERIOR 


A bounded sequence may not converge, but we can use such a sequence together with 
the supremum and infimum operations to define two sequences that do converge. 


3.8.1 The Limit Superior of a Bounded Sequence 


Let (z,) be a bounded sequence. Thus, the set {x, : k > 1} is bounded. Let a be a 
lower bound and let b be an upper bound for the set {xz, : k > 1}. So a and 6 are, 
respectively, a lower bound and an upper bound for any subset of {a : k > 1}. For 
each n € N, consider the set {x, : k > n}. Clearly, we have that 


eek lS doe eo) epee hoo) Se SS fer ke Sa} (3.17) 


where AD B means that B C A. As {ay : k > 1} is a bounded set, it follows from 
(3.17) that each of the sets {x, : k > n} are bounded. Moreover, (3.17) implies (see 
Exercise 4 on page 46) that 


sup{z,: k >1}>sup{r,:k > 2} >--->sup{ap,:k >n}>---. (3.18) 


Thus, using the bounded sequence (,,), we can construct a decreasing sequence (3,,) 
by defining 

Bn =sup{z, sk = n} 
for all n > 1. Hence, 8; > 82 > 63 > ---. Since a is a lower bound for each set 
{xz,:k > n}, it follows that 6, > a for all n > 1. Lemma 3.4.4 thus implies that the 
sequence (G,,) converges. We now introduce notation which is used to identify the 
limit of the sequence (f,). 


Definition 3.8.1. Let (x,) be a bounded sequence. Let 8, = sup{r, : k > n} 
for each n > 1. The limit superior of (z,,), denoted by limsup zy, is defined by 
noo 


lim sup t, = lim Bp, that is, 
noo No 


lim sup <= jim (sup{ix cde 2) 


The sequential limit theorems covered in Section 3.2 may not hold for the limit 
superior operation. Compare the following theorem with Theorem 3.2.2. 


Theorem 3.8.2. Let (x,,) and (yp) be bounded sequences. Then 


lim sup(Zp, + Yn) < limsup x, + limsup yp. 
noo noo noo 


Proof. Let (ap) and (yn) be bounded. So (an + Yn) is bounded. For each n € N, let 
Cp = Spe HR eS hs OS Se PRS DY, ee a Se 


Let n EN. As rp < Bp and yx < Ym for all k > n, we see that rp + yp < By tn for all 
k > n. So Bn+n is an upper bound for the set {z, + y, : & > n}. Thus, on < Bn+n 
for all n > 1. Hence, lim On < lim Bat lim Yn by Theorems 3.2.13 and 3.2.2. Now, 


by Definition 3.8.1, we conclude that lim sup(1, + yn) < limsup z, + lim sup yp. 
N—- Oo N—- Oo Noo 
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There are bounded sequences where the inequality in Theorem 3.8.2 is strict (see 
Exercise 19). Thus, the properties that we established for the limit of a sequence (e.g., 
Theorem 3.2.2) do not necessarily hold for the limit superior of a bounded sequence. 
But the sequential limit theorems can be used to prove limit superior theorems. 


Theorem 3.8.3. If (x,) is a bounded sequence and (x,,,) is a convergent subsequence 


of (x»), then lim x, < limsup xy. 
100 noo 


Proof. Let (a) be bounded and let (x,,) be a convergent subsequence of (x,,). For 
each n > 1, let 6, = sup{x, : k > n}. So lim Bn = lim sup 2. Observe that 


N—- Oo 


Gy. S sup te kS i} — Gps forall 4S 1, 
Thus, mn, < Gn, for all 7 > 1. Therefore, 


lim ¢,, < lim $,, = lim 6, 
1 0o 1 0O noo 


by Theorems 3.2.13 and 3.3.4, respectively. Hence, lim rp, < limsup %p. 
100 n—-0o 


Let (x) be a bounded sequence. For each n > 1, let 8, = sup{x, : k > n}. Then 
for every n > 1 and € > 0, there exists a k > n such that 8, —¢ < xz (see Exercise 7 
on page 46). This observation will be applied in the following proof. 


Theorem 3.8.4. If (x,) is bounded, then (x,,) has a convergent subsequence (ap,) 


such that lim z,, = limsup xp. 
ioo n—0o 


Proof. Let (x,) be a bounded sequence and for each n > 1, let 6, = sup{xp, : k > n}. 
Thus, limsup vp, = lim Gry and (Bn) is a decreasing sequence. We will now define a 


subsequence (Xp,) of (tn), by induction on 7, such that 
1 
Pn; — ~ < %n,,; for alli > 1. (3.19) 
i 


(1) Since 6; = sup{a, : k > 1}, let ny > 1 be such that 8; — 1 < ry,. As Bn, < (i, 
it follows that 6,, —1< 2p,. 


(2) Let i > 1 and assume that the natural number n; has been defined. Since 
Bas Sap lag ek See 1}, 


let ni4, > nj +1 be such that B,,41 — a < Inj, Since N41 > m4 +1, we have 
that Bn... < Bni41- Hence, 


1 


4 < 
Briss aa 


Britd _ < Tnizi- 


+1 


Thus, Bn... — a < £n,;,,- This completes the inductive definition of (xp,). 
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Theorem 3.3.4 implies that lim 6,, = lim Bn. AS Bn; = Ln;, we see from (3.19) that 
100 noo 


1 
Gre Cine) = Bap — tne < +; for alls 21, 
i 
Lemma 3.2.11 implies that lim x,, = lim 6,,. Thus, jim Ln, = limsup Lp. LO 
1-00 1-00 n—0oo 


Corollary 3.8.5. Let (z,) be a convergent sequence. Then Jim, Ln = limsup Ly. 
N—-Oo 


Our next theorem will be used in Chapter 8 to derive results on power series (see 
Theorem 8.3.5). 
Theorem 3.8.6. Suppose that lim Zn > 0 where zy, # 0 for all n > 1. If (yn) isa 


bounded sequence, then limsup(%nyn) = lim 2, - limsup yn. 
noo MOO noo 


Proof. Suppose that lim Ln = € > 0 where zx, ¥ 0 for all n > 1. Let (y,) be a 


bounded sequence. Since (x,,) converges, Theorem 3.1.24 implies that (a,,) is bounded. 
Thus, (%nYn) is bounded. Let p = _ 1 sUP(TnYn) and y = limsup y,. We will show 
noo 


that p = fy. By Theorem 3.8.4 there i is a convergent subsequence (yn,) of (Yn) such 
that jim Yn, = y. Theorem 3.3.4 implies that jim Ln, = €. Thus, by Theorem 3.2.3, 


we conclude that Jim Ln, Un, = fy. Therefore, by < < p by Theorem 3.8.3. 
Since p = lim nS een there is a convergent subsequence (%n,;Yn;) Of (CrYn) Such 
oy 


that lim %,Yn,; = p, by Theorem 3.8.4. Since lim x,, = , Theorem 3.2.7 implies that 
1 0O ICO 


Thus, lim yn; = %. Hence, § < y by Theorem 3.8.3. So p < ¢y. Therefore, p = Cy. 
1 00 


Definition 3.8.7. Let (x,,) be a bounded sequence. A subsequential limit of (,,) 
is any real number that is the limit of some convergent subsequence of (x,). 


If (xn) is a bounded sequence of real numbers, then by Theorem 3.5.1 we know 
that (z,,) has a convergent subsequence. It is sometimes useful to know the maximum 
limit that such a subsequence can have. Our next theorem shows that the limit 
superior of (z,) is, in fact, this maximum subsequential limit (see Definition 2.3.5). 


Theorem 3.8.8. Let (x,) be a bounded sequence and also let S be the set of all 
subsequential limits of (7). Then limsupz, € S and tim aeuD 2, =sup(S). 

N—-0o 
Proof. Let (2) be bounded and let S' be the set of all subsequential limits of (x). 
Theorem 3.8.4 implies that limsup xz, € S. Also, limsup x, is an upper bound for S, 


Noo 


noo 
by Theorem 3.8.3. Hence, lim sup xp, = sup(S). 
noo 
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3.8.2 The Limit Inferior of a Bounded Sequence 


Let (x) be a bounded sequence. Thus, the set {x, : k > 1} is bounded. Let a be a 
lower bound and let 6 be an upper bound for the set {a, : k > 1}. So a and b are, 
respectively, a lower bound and an upper bound for any subset of {x, : k > 1}. For 
each n € N, consider the set {x, : k > n}. Clearly, we have that 


{ip ekh ol Ste ek 2) oO fap ke Sb Oe: Dee hemos (8.20) 


where A D> B means that B C A. As {xy : k > 1} is a bounded set, it follows from 
(3.20) that each of the sets {x, : k > n} are bounded. Moreover, (3.20) implies (see 
Exercise 4 on page 46) that 


int ee eS i pes be 2) ile fe 8} Seo Sint ee ee heey 


Thus, using the bounded sequence (,,), we can construct an increasing sequence (a) 
by defining 

Gy =i ae eS a} 
for all n > 1. Hence, ay < ag < ag <---. Since 6 is an upper bound for each set 
{z, : k > n}, it follows that an < 6 for all n > 1. Lemma 3.4.2 thus implies that 
the sequence (a,,) converges. The next definition introduces notation that is used to 
identify the limit of the sequence (a,). 


Definition 3.8.9. Let (x,) be a bounded sequence. Let a, = inf{z, : k > n} 
for each n > 1. The limit inferior of (xz,), denoted by lim. inf z,, is defined by 
lim inf 7, = lim Qn, that is, 


n> 


lim inf cs = jim (inf {xx :k>n}). 


The proofs of the following five results, from Theorem 3.8.10 to Theorem 3.8.14, 
can be adapted from the proofs of the corresponding theorems and corollary in the 
previous section. 


Theorem 3.8.10. Let (x,,) and (yn) be a bounded sequences. Then 
lim inf Ln + lim inf Yn < lim inf(a, + Yn). 
Theorem 3.8.11. Let (x,) be a bounded sequence and let (x,,) be a convergent 


subsequence of (,). Then liminf x, < lim 2p,. 
noo 100 


Theorem 3.8.12. Let (x,) be a bounded sequence. Then (x,,) has a convergent 
subsequence (Xp,) such that lim z,, = lim, Tat 4. 

woo n> Co 
Corollary 3.8.13. Let (x,) be a convergent sequence. Then lim Yn = liminf rp. 


n N—- Oo 


If (x) is a bounded sequence, then our next result shows that the limit inferior 
of (x»,) is equal to the minimum subsequential limit of (7) (see Definition 2.3.5). 


Theorem 3.8.14. Let (x,) be a bounded sequence and also let S' be the set of all 
subsequential limits of (7). Then lim. inf x, € S and lim, LL pT (9): 
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3.8.3 Connections and Relations 


We now focus on some relationships between the limit superior and limit inferior 
of a bounded sequence. We will also derive some useful tools, which will be applied 
in Chapter 7, for dealing with the convergence of infinite series. In particular, these 
tools will be used to derive a ratio test and a root test. 


Theorem 3.8.15. Let (x,,) be a bounded sequence. Then (x,,) converges if and only 


if lim, inf In = limsup Lp. 
noo 


Proof. Let (a) be bounded. Corollary 3.8.5 and Corollary 3.8.13 imply that if (x,) 


converges, then lim: inf Xn = limsup Z,. The converse also holds (see Exercise 21). 
N—-0o 


Theorem 3.8.16. Let (x,,) be a bounded sequences. Then 


1. lim, inf Ln < limsup Ly. 
noo 


2. If limsupz, < b, then there is an N € N such that x, < 6 for alln > N. 


N—-0o 


3. If lim inf In > a, then there is an N € N such that z, > a for alln > N. 


Proof. See Exercises 3, 4, and 5. 


Theorem 3.8.17. Let (x,,) be a sequence of positive terms where (2 


Then ((am)*) is bounded and 


- ) is bounded. 


2 op Ent] ae alas 1 . 1 ‘ Unt 
lim inf “** < lim inf (x,)= <limsup (a,)* <limsup 2. 
noo Ln n—-Cco noo N+ Co In 


(3.21) 


Proof. Suppose that (x,,) is a sequence whose terms are all positive and that (nit ) 


is bounded. By Exercise 16, ((an)*) is bounded. The “middle” inequality in (3.21) 


holds by Theorem 3.8.16(1). To prove that lim sup (xp) * < ina n sup “nit suppose that 
noo 


lim sup + < lim sup (2n)r. Thus, there is a b > 0 (see ee 1) § such that 
N—- Oo 


N—- Oo 


lim sup ntl <b < lim sup (2n)*. (3.22) 


n—0o qn n—0o 


Since lim sup =* < 6, there exists an N € N such that “+ < b for all n > N, by 


Noo 


Theorem 38.16(2 ). Thus, by Exercise 16, limsup(xp)* <b which contradicts (3.22). 
N—- Oo 


A similar argument, using Exercise 18, shows that lim inf + < lim inf (xp). 
noo «tn n—-0o 


Corollary 3.8.18. Let (x,,) be a sequence of positive terms. 


(1) If (% 


(2) If (uit ) diverges to oo, then im, (t,)" = = lim, 2 ntl — Oo 


S00 En 


1) converges to y € R, then Jim, (7) = 7. 


En+1 
n 


Proof. Corollaries 3.8.5, 3.8.13, and Theorems 3.8.17, 3.8.15 imply (1). Exercise 10 
on page 92 establishes (2). 
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Exercises 3.8 


1. 
2. 


17. 


*18. 


Find an example to show that equality in Theorem 3.8.2 may not hold. 


Let (r,) be a bounded sequence and let m € N. Hence, (%m4n) is also a 


bounded sequence (see Definition 3.3.10). Show that limsup x, = limsup %m+n 
N—- Oo N—- Oo 


and lim inf v7, = liminf t4n. 
noo noo 


. Prove Theorem 3.8.16(1). 
. Prove Theorem 3.8.16(2). 
. Prove Theorem 3.8.16(3). 
. Let (%,) and (yp) be bounded sequences. Suppose that x, < Yp for all n € N. 


Prove that limsupz, < limsup y, and lim inf x, < liminf yp. 
noo noo n—?00 noo 


. Let (x,) be a bounded sequence. Let N, m, and M be such that m < x, < M 


for alln > N. Prove that m < lim inf Ln <limsupzy, < M. 
n [oe) 


N—- Oo 


. Prove that lim sup(c + @,) = c+ limsupz,, when (x,) is bounded and c € R. 


noo N—-0o 


. Prove that lim inf(¢ + In) =ct+ lim inf Xn, When (x7) is bounded and c € R. 


. Prove that lim sup(cz,) = climsup x,, when (x,,) is bounded and c > 0. 


noo N—- Oo 


. Prove that lim inf (cap ) =a clim inf Xn, When (x,,) is bounded and c > 0. 


. Prove that lim sup(czr;,) = clim inf Xn, When (x,,) is bounded and c < 0. 


noo 


. Prove that lim inf (cap ) = climsup %,, when (x) is bounded and c < 0. 


N—- Oo 


. Prove that lim sup(c*z,) = limsup x, when (xp) is bounded and c > 0. 


noo N—- Oo 


. Prove Theorem 3.8.10. 
*16. 


Let (tn) be a sequence of positive terms. Let b > 0 and N € N be such that 
anti < 6 for alln > N. Let c= ay > 0. 


(a) Prove, by induction, that ry, < cb”, for all n > 1. 
(b) Infer that (cvan)* < (c)nb for all n > 1, and that ((an4n)*) is bounded. 


(c) Prove that limsup (xp) < b. 
N—- Oo 


Let (#,) be bounded with positive terms. Show that ((an)* ) is bounded. 


Let (xt) be a sequence of positive terms where (zx,) and (2211 ) are bounded. 
Let a>0 and N EN be such that a < for alln > N. Let c= ary > 0. 


In 


(a) Prove, by induction, that ca” < xy 4n, for all n > 1. 
(b) Infer that (c)na < (yan) for all n> 1. 


a 
n 


(c) Prove that a < lim inf (ig) 


*19 


20 
*21 


22. 
23. 
24. 
25. 
26. 


27. 
28. 


29. 
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. Consider the bounded sequences (%,) = ((—1)”) and (yn) = ((—1)"""). Compare 
the values of 


lim sup(%p + yn) and limsup x, + lim sup yn, 
noo n—0o n—0o 


and compare the two values 


lim sup(Zp + Yn) and limsup x, - limsup Yp. 
noo noo noo 


. Prove Corollary 3.8.5. 


. Let (x,) be a bounded sequence. For each n € N, let a, = inf{x, : k > n} and 
Ba Ssuplag tho a 
(a) Show that ay <p < By for all n > 1. 


(b) Show that liminf x, < lim sup xp. 
NCO noo 


(c) Suppose that lim inf z, = @ = limsupz,. Show that lim x, = @. 
n-co n—-0oo noo 


Prove Theorem 3.8.10 by appropriately modifying the proof of Theorem 3.8.2. 
Prove Theorem 3.8.11 by appropriately modifying the proof of Theorem 3.8.3. 
Prove Theorem 3.8.12 by appropriately modifying the proof of Theorem 3.8.4. 
Prove Theorem 3.8.14 by appropriately modifying the proof of Theorem 3.8.8. 
Let (a) and (yp) be a bounded sequences with positive terms. 
(a) Prove that lim sup(2, - Yn) < limsup x, - limsup yp. 

n—¥00 n—¥00 noo 
(b) Find an example to show that equality may not hold in part (a). 
Show that there no sequence whose set of subsequential limits is { 2 € N}. 
Suppose that Jim, Ln = > 0 where x, #0 for all n > 1. Let (y,) be a bounded 


n 
n+1 
sequence. Prove that lim: inf(2nYn) =f lim. inf Yn- 

Suppose that lim In = €. Let (yn) be a bounded sequence. Prove that 
n—?Co 


lim sup(%p + Yn) = + limsup y, and liminf(r, + y,) = ¢ + liminf yp. 
Noo n—-0oo n—+>0o noo 


Exercise Notes: For Exercises 4, let o = lim Gn where 6, = sup{z,: k > n}. Let 
€ =b—o > 0. For Exercises 10-18, recall Theorem 2.3.13. For Exercises 14, recall 
Corollary 3.2.9. For Exercises 28 and 29, review the proof of Theorem 3.8.6. 
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CHAPTER 4 


Continuity 


Virtually all the functions discussed in a calculus course are continuous. In such a 
course it is often said that a function is continuous if its graph is an “unbroken” 
curve, that is, one can pencil-sketch the graph without raising the pencil off the 
page. Of course, the concept of an unbroken curve is intuitive and easily understood, 
but this description is not mathematically precise. In the following section we shall 
present a mathematical rigorous definition of the continuity concept. We also show 
that the algebraic combination and the composition of continuous functions produces 
a continuous function. In the remaining sections of this chapter, we will show how 
continuity is connected to sequences and limits. Moreover, continuous functions have 
many nice properties that will be identified in theorems that will appear in this and 
the remaining chapters of the book. 


4.1 CONTINUOUS FUNCTIONS 


Let D C R. What does it mean to say that a function f: D — R is continuous? 
Intuitively, a continuous function is one for which a small change in the input results 
in a small change in the output. Here is the precise definition.! 


Definition 4.1.1. Let f: D — R and let c be in D. Then f is continuous at c 
when the following holds: 


For every ¢ > 0 there exists a 6 > 0 such that for all x € D, if |x —c| < 6, 


then | f(x) — f(c)| <e. 
We say that f: D— R is continuous when f is continuous at every point c € D. 


The phrase “if |2 — c| < 6, then | f(x) — f(c)| < e” in Definition 4.1.1 is illustrated 
in Figure 4.1. The logical form of Definition 4.1.1 can be expressed as 


(Ve > 0)(4d > 0)(Va € D) (|x — 


<0 =7|f(2) = fle) <8) (4.1) 


and it is this logical form that motivates the next proof strategy. 


‘We will always presume that D C R. 
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Figure 4.1: Illustration for Definition 4.1.1. 


Proof Strategy 4.1.2. Let f: D— R and let c € D. To prove that f is continuous 
at c, use the proof diagram: 


Let ¢ > 0 be an arbitrary real number. 
Let 6 = (the positive value you found). 
Let x € D be arbitrary. 
Assume |x — c| < 6. 


Prove |f(x) — f(c)| <e. 


To apply proof strategy 4.1.2 to a function f that is defined by an explicit formula, 
first let ¢ > 0. Then we must find a 6 > 0 so that when x € D and |x —c| < 4, we 
can prove that | f(x) — f(c)| < e. To find the desired 6, we use algebra and properties 
of inequality on the expression |f(x) — f(c)| to “extract out” a larger value that 
contains |2 — c| and no other occurrences of «. This process may involve finding an 
appropriate upper bounds for “unwanted” factors that involve x, but are different 
than |x — c|. Upon completing this process, we should then be able to find 6 so that 
when |x — c| < 6, we will have | f(x) — f(c)| < ¢. We illustrate this technique in our 
proof analysis of the next theorem. 


Theorem. Define f: R > R by f(x) = 327 + 5. Then f is continuous at c € R. 


Proof Analysis. Let c € R and let ¢ > 0. Using algebra and properties of inequality 
on the expression | f(a) — f(c)|, we extract out |x — c| as follows: 


f(x) — fle)| = |3a? + 5 — (3c? + 5) by definition of f 


= |3(¢? — )| by algebra 
= 3|z+¢||x-—c by algebra and property of | |. 
In this case, we end up with the unwanted factor |2 + c|, which involves 7. We need 
to find an upper bound for this factor. We are free to make |x — c| as small as is 
needed to obtain an upper bound for |z + c|. Let us start with |x + c| and extract 
out a larger value that involves |x — c| as follows: 
je+cl=|x—c+2c| _ by subtracting and adding c 
<|x—cl+2|c| by triangle inequality. 


So if |x — c| < 1, we have that |x + c| < 142 |c| and | f(x) — f(c)| < 3(14+2|e|) |x — el. 
Solving the inequality 3(1+ 2 |c|) | — c| < e for |x — c|, we obtain |x — c| < 
Thus, if |e —c| < 1 and |r —c¢| < 
So, we will let 6 = min{1 


34 3a)° 
34g) We can conclude that f(x) — f(e)| <e. 


. sates We can now present a logically correct proof. 
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Proof. Let f: R — R be the function defined by f(x) = 327 + 5. Let ¢ € R and 
é > 0. Let 6 = min{1, 357} where M =1+2|c|. Let x € R satisfy |x — c| < 6. Since 
jc —c| <6 <1, it follows that 


jv + cl = |x —c+ 2c by subtracting and adding c 
<|x—cl+2|c| _ by triangle inequality 
<1+2|c|=M because |x —c| <6 <1. 


Thus, (A) |x + c| < M. We now prove | f(x) — f(c)| < € as follows: 


IF(2) — f(0)| = [80 + 5 — (3c? + 5)] by definition of f 


= 32” _ é)| by algebra 

=3|r+e||z—c¢| by algebra and property of | |. 
<3M |x —¢| by (a) 

<3Mx =e because |e — el <8 < so. 


Therefore, | f(x) — f(c)| < ¢ and f is continuous at c. 


There may times when one is required to assume or show that a function is not 
continuous at a point. By evaluating the negation of (4.1), our next remark clarifies 
the meaning of discontinuity at a point. 


Remark 4.1.3. Let f: D— R and let c € D. Then f is not continuous at c if and 
only if there exists an ¢ > 0 such that for all 6 > 0, there is an x € D such that 
|x —c| < 6 and |f(x) — f(c)| >. 


Combinations of Continuous Functions 


Recall that a real-valued function is one that has the form f: D— R where DCR. 
Given two real-valued functions with the same domain, one can use the standard 
algebraic operations on the real numbers to define a new real-valued function. 


Definition 4.1.4. Let f: D— Rand g: D > R. We define new functions as follows: 


(1) The sum (f + g): D > R is defined by (f + g)(x) = f(x) + g(a) for all x € D. 

(2) The difference (f — g): D — R is defined by (f — g)(x) = f(x) — g(x) for all 
ee dD, 

(3) The product (fg): D — R is defined by (fg)(x) = f(x)g(x) for all x € D. 

(4) For k € R, the constant multiple (kf): D — R is defined by (kf)(x) = kf (x) 
for all x € D. 

(5) Let D* = {x € D: g(x) £ 0}. Then the quotient (4): D* —> R is defined by 


g 
(4) (a) = oe for all « € D*. 


We will show that if f and g are continuous at a point c, then all of the operations 
defined in Definition 4.1.4 will produce a function that is also continuous at c. In other 
words, the above operations (1)—(5) preserve continuity. 
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Preservation-of-Continuity Theorems 
Consider a theorem having the following generic form. 


Theorem. Let f: D— Rand g: D > R be functions and let c € D. Suppose that 
f is continuous at c and g is continuous at c. Then a new function h: D > R, which 
is defined from the given functions, is also continuous at c. 


How does one prove theorems that have this form? We are assuming that f and 
g are continuous at c, and we must prove that h is continuous at c. 


Proof Strategy 4.1.5. To prove that h is continuous at c, use the proof diagram: 


Assume f and g are continuous at. c. 
Let ¢ > 0 be an arbitrary real number. 
Let 6 = (the positive value you found). 
Let x € D be arbitrary. 
Assume |x — c| < 6. 


Prove |h(x) — h(c)| <e. 


To apply proof strategy 4.1.5, let ¢ > 0. We must then find a 6 > 0 such that 
|h(a) — h(c)| < ¢ when |x — c| < 6. Since f and g are continuous at c, we can make 
| f(x) — f(c)| and |g(x) — g(c)| “as small as we want.” Here is the basic idea that we 
will apply to get 6: 

Using algebra and properties of inequality on the expression |h(x) — h(c)|, 
we extract out a larger value that contains | f(x) — f(c)| and |g(x) — g(c)|, 
and no other occurrences of x, f(x), or g(x). 


Since we can make | f(x) — f(c)| and |g(x) — g(c)| “as small as we want,” we should 
be able to find 6. These ideas appear in the proof analysis of our next two theorems. 


Remark. If f: D — R is continuous at c, then for any value e > 0, there is a d > 0 
such that | f(x) — f(c)| < e whenever |x —c| < 6 and x € D. This is the meaning 
behind the phrase “we can make | f(«) — f(c)| as small as we want.” 


Theorem 4.1.6. Let f: D — Rand g: D—- R be functions, and let c € D. Suppose 
that f and g are both continuous at c. Then (f + g) is continuous at c. 


Proof Analysis. Let f and g be continuous at c. So we can make | f(a) — f(c)| and 
|g(x) — g(c)| as small as we want. Let ¢ > 0. To ensure that 


(f+ (x) —(F + gO] <€ (4.2) 


when z is sufficiently closet to c, we first extract out | f(x) — f(c)| and |g(x) — g(c)| 
from the expression |(f + g)(x) — (f + g)(c)| as follows: 


(Ff +(e) —(F + MO = IF) + 9(2)) — (Fl) + 9 
= |(F(@) — Fle) + (g(a) — g 
< |f(«) — f(c)| + |e(x) — g(c)| by triangle inequality. 

So if |f(x) — f(e)| < 5 and |g(x) — g(c)| < §, then we can conclude (4.2). This 
analysis and proof strategy 4.1.5, guides the following logically correct proof. 


c))| by definition of f +g 
c))| by algebra 


—_ 
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Proof. Let f: D— Rand g: D> R be continuous at c € D. To prove that f + g is 
continuous at c, let ¢ > 0. Since f is continuous at c, there is a 6, > 0 such that for 
alla € D, 

f(x) — f(e)| < : when |x — cl < 6. (4.3) 


Also, because g is continuous at c, there is a d2 > 0 such that for all x € D, 
Ig(x) — g(c)| < . when |x — cl < 9. (4.4) 


Let 6 = min{61, 62}. Let x € D and assume |x —c| < 6. Thus, |x —c| < 6; and 
|x — c| < 69. We now prove that |(f + g)(x) — (f + g)(©)| < € as follows: 


(f+ g)(z) —(f + 9)(C)| = 


x)) — (f(e) + g(e))| by definition of f +9 
fe) H0)+ (g(x) — g( 
| 


+ |g(x) — g(c)| by triangle inequality 


c))| by algebra 


=€E by (4.3) and (4.4). 


Therefore, (f +g) is continuous at c. 


In the proof of Theorem 4.1.6, we were able to “cleanly” extract out | f(a) — f(c) 
and |g(x) — g(c)|, that is, there were no additional factors. In other situations there 
may be “unwanted” factors involving x, f(x), or g(x). You will then have to find an 
appropriate upper bound for these factors. This will have to be done in the proofs of 
Theorems 4.1.8 and 4.1.10. In our proof of Theorem 4.1.8, we obtain the unwanted 
factor rob Our next lemma shows that if g is continuous at c and g(c) 4 0, then 
there is a neighborhood of c and an m > 0 such that m < |g(x)| for any x in this 
neighborhood. Thus, we will also have that aay < +. So = will be our desired upper 
bound for the proof of Theorem 4.1.8. 


Lemma 4.1.7. Let g: D > R be continuous at c € D where g(c) 4 0. Then there 
exists an m > 0 and a 6 > 0 such that for all x € D, if |x — c| < 6, then m < |g(z)]. 


Proof. Let g: D — R be continuous at c € D where g(c) 4 0. Since g(c) 4 0, consider 
the positive value I9(o)| 
\g(x) — g(c)| < I9(o)| if ¢ € Dand |x —c| < 6. Let x € D satisfy |x — c| < 6. Thus, 
lg(x) — g(c)| < sl So by the backward triangle inequality, |g(c)| — |g(x)| < sl 


and [g(c)| - 9! < |g(x)|. Thus, I9(o)| < |g(x)|. Let m= Ia > 0. Hence, m < |g(x)| 
when x € D and |x —c| < 6. 


. Since g is continuous at c, there exists a 6 > 0 such that 


Let g: D > R be a function and let D* = {x € D: g(x) # 0}. We define the 
reciprocal function by (4) (2) a re for all x € D*. If g is continuous at c € D 


where g(c) £ 0, then Lemma 4.1.7 will be used to prove that (4) is continuous at c. 


Theorem 4.1.8. Let g: D > R be continuous at c € D where g(c) 4 0. Then (4) 
is continuous at c. 
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Proof Analysis. Let g be continuous at c € D where g(c) # 0. So we can make 


|g(x) — g(c)| as small as we want. Let ¢ > 0. We need to make Fe < €¢ when 


g(x) =a) 
x is close enough to c. Since g is continuous at c, Lemma 4.1.7 implies that there is 


a 6; >0 and an m > 0 such that (x) m < |g(x)| when |x — c| < 6; and x € D. Thus, 
m< Ig(e )| a |c—c| =0 < 6; and cé€ D. Note that when m < |g(x)|, we have 


that 7 ia < 4 So assume that |x — c| < 6). We can now extract out |g(x) — g(c)| 
from the ee sh — ae as follows: 
| ! : = ae = he) by algebra 
g(a) gle) g(x) g(c) 
-_ lg(z) — a(0)| property of absolute value 
lg(x)| la(e)| 
lg(x) — 9] 
< 5 by (x). 


ae < ¢. We can now 
present a logically correct proof which is guided by proof strategy 4.1.5. 


Thus, if |g(a) — g(c)| < me, we can conclude that lata - 


Proof. Let g: D > R be continuous at c € D where g(c) 4 0. Lemma 4.1.7 implies 
that there is a 6; > 0 and an m > 0 such that 


m < |g(x)| when |x — c| < 6; and x € D. (4.5) 
Let ¢ > 0. Since g is continuous at c, there is a 62 such that 
|g(x) — g(c)| < em? when |x — c| < dy and x € D. (4.6) 


Let 6 = min{61, 62}. Let  € D be such that |x —c| < 6. Thus, |x —c| < 6; and 


|x — c| < 6g, and so (4.5) and (4.6) apply. We prove that = < € as follows: 


CD) a0) 
1 1 |_ |g 92) 
Fo ro -| gage) | Be" 
= ane 4 property of absolute value 
MIA Il ay ay 
a =é by (4.6.) 


Therefore, (4) is continuous at c. 


We next show that if a function is continuous at a point, then it is bounded near 
this point. This will allow us to prove that the product and quotient of two functions 
is continuous at a point, if each of the two functions is continuous at the point. 


Lemma 4.1.9. If f: D — R is continuous at c € D, then there is a 6 > 0 and an 
M > 0 such that |f(a)| < M for all x € D satisfying |x — c| < 4. 
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Proof. Let f: D — R be continuous at c € D. Let ¢ = 1. Since f is continuous at c, 
there exists a 6 > 0 such that | f(x) — f(c)| < 1 whenever x € D and |x —c¢| < 6. 
Let x € D satisfy |x —c| < 6. Thus, | f(x) — f(c)| < 1. By the backward triangle 
inequality, we obtain | f(x)| — |f(c)| < 1. Hence, |f(x)| < 1+ |f(c)| whenever x € D 
and |x — c| < 6. Thus, for M =1+|f(c)| > 0 we have that | f(x)| < M for allze D 
satisfying |x — c| < 6. 


Theorem 4.1.10. Let f: D— Rand g: D > R be continuous at c € D. Then fg 
is continuous at c. 


Proof. See Exercise 15. 


Theorem 4.1.11. Let f: D + Rand g: D > R be continuous at c € D. If g(c) £ 0, 


then (5) is continuous at c. 


Proof. Let f: D — Rand g: D > R be continuous at c € D where g(c) 4 0. Thus, 


(4) is continuous at c, by Theorem 4.1.8. Since (£) =f. (), Theorem 4.1.10 
implies that (4) is continuous at c. 


Theorems 4.1.6, 4.1.10, and 4.1.11 ensure that any function that is an algebraic 
combination of continuous functions is a continuous function. Thus, we have the 
following result. 


Theorem 4.1.12. A polynomial function is continuous at every point c € R, anda 
rational function is continuous at every point c for which the function is defined. 


Proof. One can show that every constant functions is continuous at each point and 
the function f(x) = zx is also continuous at each point. These two facts, together with 
Theorem 4.1.6, Theorem 4.1.10 and mathematical induction, show that a polynomial 
function is continuous at every real number. Theorem 4.1.11 now implies that rational 
functions are continuous at the points stated in the theorem. 


We now identify conditions which ensure that a composite function is continuous 
at a point. 


Theorem 4.1.13. Let f: D— Rand g: E > R be functions such that f[D] C E. 
If f is continuous at c € D and g is continuous at f(c), then (go f): D > R is 
continuous at c. 


Proof. Let f: D— Rand g: E > R be as stated. Let ¢ > 0. Since g is continuous 
at f(c), there is a 6, > 0 such that 


lg(y) — g(f(c))| < € for all y € EF satisfying |y — f(c)| < 61. (4.7) 


Since f is continuous at c, there is a 6 > 0 such that | f(x) — f(c)| < 61 for all x € D 
satisfying |x — c| < 6. Now, let x € D be such that |x — c| < 6. Then f(x) € E and 
| f(x) — f(c)| < 41. Since | f(x) — f(c)| < 61, we see that (4.7) implies 
(go A(z) — (9° ACO) = lg F(@)) — g(F (©) by definition of g o f 
<r by (4.7). 


Therefore, g o f is continuous at c. 
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Our next lemma shows that if a function is continuous and positive at a point c, 
then the function is positive at all of the points in a particular neighborhood of c. 


Lemma 4.1.14. Let f: D— R be continuous at c € D where f(c) 4 0. 


(a) If f(c) > 0, there is a 6 > 0 where f(x) > 0 for all x € D satisfying |x — c| < 0. 
(b) If f(c) < 0, there is a 6 > 0 where f(x) < 0 for all x € D satisfying |x — c| < 6. 


Proof. Let f: D — R be continuous at c € D where f(c) £ 0. Let ¢ = |f(c)| > 0. 
Since f is continuous at c, there exists a 6 > 0 such that | f(x) — f(c)| < ¢ whenever 
x € D and |x —c| < 6. We now prove (a). Assume f(c) > 0. Thus, ¢ = f(c). So we 
have that | f(a) — f(c)| < « = f(c) whenever z € D and |x — c| < 6. So, if x € D and 
|x — c| < 6, then 

fl) - f(a) <|f@ - Ol < FO, 


and thus, f(c) — f(x) < f(c). We conclude that 0 < f(x) whenever x € D and 
|x — c| < 6. Thus, (a) holds. The proof of (b), using ¢« = —f(c), is very similar. 


Many important theorems in real analysis assume that a function is continuous 
at every point in its domain. Our next strategy shows how to prove that a function 
is continuous. 


Proof Strategy 4.1.15. Let f: D — R. To prove that f is continuous, use the 
proof diagram: 
Let c € D be an arbitrary. 
Prove f is continuous at c. 


Exercises 4.1 


1. Let m and d be real numbers where m ¥ 0. Consider the function f: R > R 
defined by f(x) = mx +d. Using Proof Strategies 4.1.15 and 4.1.2, prove that f 
is continuous. 


2. Let g: [—1,1] — R be defined by 


Prove that g is not continuous at 0. 


3. Find functions f: [—1,1] > R and g: [—1,1] — R that are not continuous at 0 
but f +g and fg are continuous at 0. 


*4. Define f: R > R by f(x) = |z| for all ze € R. Using Proof Strategies 4.1.15 
and 4.1.2, prove that f is continuous. 


*5. Let f: R — R be defined by f(x) = 2? for all x € R. Using Proof Strategies 4.1.15 
and 4.1.2, prove that f is continuous. 


6. Let f: [0,00) — R be defined by f(x) = /@ for all x € [0,c0). Using Proof 
Strategies 4.1.15 and 4.1.2, prove that f is continuous. 


10. 


11. 


12. 


13. 


14. 


“15. 
16. 
17. 


18. 
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. Let f: [1,3] — R be defined by f(x) = 5 for all x € [1,3]. Using Proof 


Strategies 4.1.15 and 4.1.2, prove that f is continuous. 


. Let f: D> Rand g: D—> R be functions and let c € D. Suppose that f and 


g are both continuous at c. Prove that there exists an M > 0 and ao > 0 such 
that |f(x)| < M and |g(x)| < M when |x —c¢| <6. 


. Let f: D— Rand g: D> R be functions and let c € D. Suppose that f and g 


are both continuous at c. Prove that f — g is continuous at c. 


Let f: D > R be a function and let c € D. Suppose that f is continuous at c. 
Define g: D > R by g(x) = |f(x)|. Prove that g is continuous at c. 


Let f: D > R be a function and let c € D. Suppose that f is continuous at c 
and that k € R is nonzero. Prove that kf is continuous at c. 


Let f: D —> R be continuous at c € D. Let D* C D be such that c € D*. Define 
h: D* +R by h(x) = f(x). Show that A is continuous at c. 


Let f: D > R* be a function. Suppose that f is continuous at c € D. Define 
Jf: D> Rt by JV f(x) = Vf(z). Prove that /f is continuous at c. 

Let f: D— Rand g: D> R be functions and let c € D. Suppose that 

(a) g is continuous at c, and 

(b) [f() — FQ S [g(@)| |@ — ¢| for all x € D. 

Prove that f is continuous at c. 

Prove Theorem 4.1.10. 

Prove Lemma 4.1.14(b). 


Let f: D— R be continuous at c € D and let h € R. Prove the following: 
(a) If f(c) > h, there is a 6 > 0 such that f(x) > A for all x € D satisfying 


lx —cl <0. 
(b) If f(c) < h, there is a 6 > 0 such that f(x) < h for all x € D satisfying 
lx —cl <0. 


Let f: D— Rand g: D> R be continuous at c € D. 
(a) Let h: D > R be defined by h(x) = max{f(zx),g(x)}. Prove that h is 


continuous at c. 
(b) Let k: D > R be defined by k(x) = min{ f(z), g(x)}. Prove that k is con- 


tinuous at c. 


Exercise Notes: For Exercises 8 and 14, use Lemma 4.1.9. For Exercise 15, to prove 
fg is continuous at c, note that 


f(z) 9(@) — FO9O| = |F@)9(@) — Fla)al) + Fla)a() — FE g(0)] - 


After “extracting out” |g(x) — g(c)| and | f(a) — f(c)|, use Exercise 8. For the proof of 
Exercise 17, the function f(x)—h is continuous. Apply Lemma 4.1.14. For Exercise 18, 
apply Exercise 16 on page 35, Theorem 4.1.6, and the above Exercises 9, 10, and 11. 
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4.2 CONTINUITY AND SEQUENCES 


Our next theorem shows that there is another way, using sequences, to show that a 
function is continuous at a point. This result establishes an important relationship 
between continuity and convergent sequences. As the following proof uses the negation 
of continuity, one should review Remark 4.1.3. 


Theorem 4.2.1 (Sequential Criterion for Continuity). Let f: D — R and let c € D. 
Then the following are equivalent: 

(1) f is continuous at c. 

(2) For every sequence (x,,) of points in D, if lim In = c, then lim tite 7). 


Proof. Let f: D — R and let c € D. We shall prove that f is continuous at c if and 
only if for every sequence (z,) of points in D, if lim Ln = c, then lim ta) HTC). 


(=). Assume that f is continuous at c. Let (x,) be an arbitrary sequence of points 

in D such that lim x, = c. We will prove that lim f(r,) = f(c). Let « > 0. Since 
noo noo 

f is continuous at c, there is a 6 > 0 such that 


for all x € D, if |x — ¢| < 6, then | f(x) — f(c)| <e. (4.8) 


Now, because (x,) converges to c, there is an N € N such that for all n > N, 
|%n —c| < 6. Thus, (4.8) implies that |f(z,) — f(c)| < € for all n > N. Hence, 


(<=). Assume that 


for any sequence (x,,) of points in D, if lim In = c, then lim F(t,) = Fle). (4.9) 


We prove that f is continuous at c. Suppose, for a contradiction, that f is not 
continuous at c. Thus, there is an ¢ > 0 such that 


for all 6 > 0, there is an x € D such that |x — c| < 6 and |f(x) — f(c)| > «. 


Thus, for each n € N (letting 6 = +), there is an z, € D so that |x, —c| < + and 
|f(an) — f(e)| => ¢. Therefore, using this ¢, we obtain a sequence (z,,) such that 
(i) tn € D, (ii) |a, — ¢| < 4, and (iii) |f(xn) — f(o)| > € for all n € N. It follows, 


using (i) and (ii) (see Theorem 3.1.11), that 
(Zn) is a sequence of points in D and lim i = 6 (4.10) 


However, (4.9) and (4.10) imply that lim f(@n) = f(c). So there is an n such that 
| f (an) — f(c)| < €; however, this contradicts (iii). This completes the proof. 


In Section 4.1, we used Definition 4.1.1 (the definition of continuity) to prove 
theorems about continuity. Using previously established results on sequences, we can 
produce new proofs of these same theorems, using Theorem 4.2.1. To illustrate this, 
we now restate Theorem 4.1.6 and give it a new “sequential” proof. 
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Theorem 4.1.6. Let f: D — Rand g: D—- R be functions, and let c € D. Suppose 
that f and g are both continuous at c. Then (f + g) is continuous at c. 


Proof. Let f: D— Rand g: D > R be continuous at c € D. Let (x,) be an arbitrary 
sequence of points in D such that lim Ln = c. Since f and g are continuous at c, 
n [o.e) 


Theorem 4.2.1 implies that 


lim f(v,) = f(c) and lim g(a») = 9(0) (4.11) 


noo 


We show that lim (f + 9)(%n) = (f + 9)(©) as follows: 


jim (f + 9)(@n) = lim [f(tn) + 9(@n)] by Definition 4.1.4(1) 
= Jim. f(an) + jim, g(Zn) by Theorem 3.2.2 
= f(c)+ g(c) by (4.11) 
= (f +g)(c) by Definition 4.1.4(1). 


Thus, lim (f+g9)(an) = (f+9)(c). So (f+ g) is continuous at c by Theorem 4.2.1. 


Theorem 4.2.1 can now be used to prove that a function is continuous at a point. 
It can also be used to show that a function is not continuous at a point. 


Corollary 4.2.2. Let f: D— Rand c€ D. If there is a sequence (x,,) of points in 
D such that lim Go 6 aod lim f(tn) A f(c), then f is not continuous at c. 


Exercises 4.2 


1. Define the function f: R > R by 


_ fi, ste |e Q; 
f= ace 


Prove that f is not continuous at any point c € R. 


2. Define the function f: R — R by 


(eda e O; 
fle) =|? if2éQ. 


Prove that f is continuous at 0. 


3. Let f: R — R be continuous and c € R. Suppose that f(z) = c for allz € Q. 
Prove that f(x) = c for all z ER. 


4. Let f: (0,1) > R be the continuous function defined by f(z) = 4. Give an 
example of a convergent sequence (%,,) where xz, € (0,1) for all n > 1, so that 


(f(an)) diverges. 
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5. Let f: D> R and g: D > R be continuous at c € D. Use Theorem 4.2.1 and 
Theorem 3.2.3 to prove that (fg): D — R is continuous at c. 


6. Let f: D— Rand g: E > R be functions such that f[|D] C E. Suppose that f 
is continuous at c € D and g is continuous at f(c). Use Theorem 4.2.1 to prove 
that (go f): D — R is continuous at c. 


7. Let f: D > E be one-to-one and continuous where F C R. Prove that if c € D 
is an accumulation point of D, then f(c) is an accumulation point of E. 


8. Let f: D > E be a bijection such that f~! is continuous, where E C R. Let 
(Zn) be a sequence of points in D and let c € D. Suppose that (f(x,,)) converges 
to f(c). Show that (z,) converges to c. 
Exercise Notes: For Exercises 1-3: Both Q and the set of irrational numbers R \ Q 
are dense in R. Apply Lemma 3.1.18 and Theorem 4.2.1. 


4.3 LIMITS OF FUNCTIONS 


In analysis, the limit of a function is a fundamental concept concerning the behavior 
of a function near a particular point c. Informally, a function has a limit LD at the 
point cif f(x) is “very close” to L whenever x is “very close” to c. Before we present 
the formal definition, we recall Definition 3.5.2 of Chapter 3. 


Definition. Let S C R. A point x € R is said to be an accumulation point of S if 
every neighborhood of xz contains an infinite number of points from S. 


Definition 4.3.1. Let f: D — R, LE € R, and c be an accumulation point of D. 
Then L is the limit of f at c, denoted by lim f(x) = L, if for every « > 0 there 
exists a 6 > 0 such that for all x € D, if 0 < |x —c| < 6, then |f(x) — L| <«. 


The definition of the limit at a point c is very similar to the definition of continuity 
at c, however, there are three key differences. First, the definition of the limit requires 
c to be an accumulation point. This is not required in the definition of continuity. 
Second, in the limit definition, the inequality 0 < |” —c| < 6 is used instead of the 
inequality |x — c| < 6. Thus, in Definition 4.3.1, the point c is not assumed to be in 
the domain of the function, whereas in the definition of continuity, c must be in the 
domain. Finally, in the above definition, the limit value L is used rather than f(c). 
The term f(c) is crucial for the definition of continuity, but it is not relevant in the 
above limit definition. 

Definition 4.3.1 can be used in a proof by means of the following proof strategy. 


Proof Strategy 4.3.2. Let f: D — R and let c be an accumulation point of D. To 
prove that lim f(x) = L, use the proof diagram: 


Let ¢ > 0 be an arbitrary real number. 
Let 6 = (the positive value you found). 
Let x € D be an arbitrary. 
Assume 0 < |x —c| < 6. 
Prove |f(x) —L| <e. 
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To apply proof strategy 4.3.2 to a function f, defined by a formula, first let 
é > 0. Then find a 6 > 0 such that when 0 < |x —c| < 4, one can prove that 
| f(x) — L| < ©. To find the desired 6, use algebra and properties of inequality on 
the expression | f(x) — L| to “extract out” a larger value that contains |x — c| and no 
other occurrences of x. 


Theorem. Let f: R — R be defined by f(x) = x? + 2x + 4. Then lim, play. 
xr 


Proof Analysis. Let ¢ > 0. From the expression | f(x) — 19| we need to “extract out” 
|x — 3| and no other occurrences of x. Observe that 


|f() — 19| = 2? + 20 - 15] = |x +5] |x — 3] 


where we have |x — 3| and the term |x + 5| which contains x. Let us assume that 
|x — 3| <1. Then 


jo + 5| = la -—34+3+45] < |x —3)/4+8<14+8=9. 


Thus, | f(z) — 19| < 9|x2 — 3]. So, if we have that |x — 3] < 1 and 9|r —3] < «, we 
can conclude that | f(x) —19| < ¢. Solving the inequality 9 |x — 3| < ¢ for |x — 3], 
é, 


we obtain |x — 3| < 5. Hence, we will let 6 = min{1, 5}. We can now compose the 


following logically correct proof. 


Proof. Let ¢ > 0 and let 6 = min{1, 5}. Assume that 0 < |x — 3] < 6. So in particular, 
|x — 3| < 1. It follows that |x + 5| < 9. Therefore, 


[f(2) — 19| = |2? + 22 — 15] = |e + 5||e—3| <9|2— 3] < 95 $9= =e. 


In the proof of Theorem 4.3.4, below, we will be assuming the negation of the 
definition of a limit. Our next remark clarifies the meaning of this negation. 


Remark. Let f: D > R, L € R, and let © be an accumulation point of D. Then 
lim f(x) # L, if and only if there is an ¢ > 0 such that for all 6 > 0 there is an x € D 


such that 0 < |x —c| <6 and |f(z) — L| >. 


As noted earlier, there is a close connection between the definition of continuity 
of a function f at a point c and the definition of the limit of f at c. This intimacy 
leads to our next result which provides an alternate way to interpret continuity at c, 
when c is in the domain of f and c is also an accumulation point of the domain. 


Lemma 4.3.3. Let f: D— R and let c € D is an accumulation point of D. Then f 
is continuous at c if and only if lim ey = Te): 


Proof. This is easy to see by comparing the definition of the continuity of f at c, 
Definition 4.1.1, and the definition of the limit of f at c, namely Definition 4.3.1. 


We next present an important link between the limit of functions and the limit 
of sequences. As we have proven many results about sequences, this link will allow us 
to derive theorems on the limits of functions by appealing to previously established 
results on sequences. 
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Theorem 4.3.4 (Sequential Criterion for Functional Limits). Let f: D> R, LER, 
and c be an accumulation point of D. Then lim f(x) = L if and only if for every 


sequence (s,,) of points in D \ {c}, if lim Sn = c, then lim Feet. 


Proof. Let f: D — R and let c be an accumulation point of D. We shall prove the 
above stated equivalence. 


(=). Assume that lim f(x) = L. Let (s,) be a sequence of points in D \ {c} such 
that lim Sn = c. Let € > 0. Since lim f(x) = L, there is a d > 0 such that 
for all x € D, if 0 < |x —c| < 6, then |f(z) — L| <e. (4.12) 


As lim Sn = Cc, there is an N € N such that 0 < |s, —c| < 6 for all n > N. Thus, 
(4.12) implies that |f(s,) — L| < ¢ for alln > N. Hence, lim Fn) Sd 


(<=). Assume that for every sequence (s,,) of points in D \ {c}, 
if Jim Sn, = c, then im F( Se) = de (4.13) 
To prove that lim f(x) = L, assume to the contrary that L is not a limit of f at c. 
Thus, there is an ¢ > 0 such that 
for all 6 > 0, there is an x € D such that 0 < |x —c| < 6 and |f(x) —L| >. 


So for each n € N, there is an s, € D such that 0 < |s, — c| < 4 and |f(s,) — L| >. 
Therefore, (s,) is such that (i) 8, € D, (ii) 0 < |sn — ¢| < + and (a) |f (sn) — L| ><, 
for all n EN. It follows, from (i) and (ii), that 


(Sn) is a sequence of points in D \ {c} such that lim Sn = ¢. (4.14) 


However, (4.13) and (4.14) imply that lim f (Sn) = L. This contradicts (a). 


The following corollary offers a method for showing that a functional limit does 
not exist. This method avoids having to work with the negation of the limit definition. 


Corollary 4.3.5. Let f: D — R and let c be an accumulation point of D. If there 
is a sequence (s,,) that converges to c such that (f(s,)) does not converge, then the 
limit of f at c does not exist. 


The proof of Theorem 4.3.4 can easily be modified (see the proof of Theorem 3.5.4) 
to derive our next corollary which will be applied in Chapter 5. 


Corollary 4.3.6. Let f: D— R, L € R, and c be an accumulation point of D. Then 
lim f(x) = L if and only if for every sequence (s,,) of distinct points in D \ {c}, if 


dim, Sn, = c, then Jim, fey) =f, 


In the following theorem, we record the basic algebraic properties of functional 
limits. For example, we will show that if f and g have limits at c, then so does the 
sum function (f + g). In the proof of this theorem, we use Theorem 4.3.4 and some 
relevant results about sequences. 


Continuity M 115 


Theorem 4.3.7. Suppose that lim J (ae): =-1-and lim g(x) = M, where f: D> R 
and g: D > R are functions and c is an accumulation point of D. Then 
(1) lim(f + 9)(w) = b+ M. 
(2) pl Fa)(a )= LM. 
(3 li umn (k: f)(z) = kL, where k € R is any constant. 
) If 


(4 
Proof. Let f: D > R and g: D > R be functions and let c be an accumulation 
point of D. Assume lim f(z) = L and lim g(x) = M. We shall only prove (1) and 
leave (2)—(4) for Exercise 7. To prove that lim (f +g)(x) = L+M, let (s,) be an 
arbitrary sequence of points in D \ {c} that converges to c. Since lim f(a) =P and 
lim g(x) = M, Theorem 4.3.4 implies that 


g(x) £0 for alla € Dand M £0, then lim (£ \(2) = 


Jim f (Sn) = L and Jim g(Sn) = M. (4.15) 
Thus, 
lim (f +9)(Sn) = lim [f (Sn) + 9(Sn)] by Definition 4.1.4(1) 


= lim f(Sn) + lim g(Sn) by Theorem 3.2.2 
=L+M by (4.15). 


Theorem 4.3.4 now implies lim (f + 9)(4) =L+M. 
We now present a squeeze theorem for functions. This theorem is analogous to 

Theorem 3.2.8, the squeeze theorem for sequences (see Exercise 8). 

Theorem 4.3.8 (Squeeze Theorem). Let f,g,h be functions from D to R such that 

f(x) < g(a) < h(x), for all « € D. 

Let c be an accumulation point of D. If lim fia l= lim h(x), then lim Go =. 
Theorems 4.2.1 and 4.3.4 imply the following result (see Exercise 9) on the limit 

of a composition. 


Theorem 4.3.9. Let g: J — I and h: I > R, where J and J are intervals. Suppose 
that g is continuous at c € J. If ea ' h(y) = L, then lim Rote) = Te 
y>g(c xz-7C 


One-Sided Limits 


Let f: J — R be a function where J is an interval. Let c be an accumulation point 
of I. Thus, Definition 4.3.1 precisely defines the limit notation lim f(x) = L. However, 
there are times when we need to emphasize how x approaches c in the limit process. 
As cis an accumulation point of the interval J, it follows that c is either an interior 
point or an endpoint of J. We now introduce the notion of a one-sided limit. 
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Definition 4.3.10. Let f: J — R be a function where J is an interval, and let c be 
an accumulation point of I. 


(1) Right-Hand Limit: Suppose that c is an interior point or a left endpoint of J. We 
write lim, f(x) = L if and only if for every ¢ > 0, there is a 6 > 0 such that for 
rc 
alla eI, if0 < |x—c| <6 and c<zg, then |f(z) — D| <e. 


(2) Left-Hand Limit: Suppose that c is an interior point or a right endpoint of J. 
We write lim f(x) = L if and only if for every ¢ > 0, there is a 6 > 0 such that 
M i 2 Om 


for all x € I, if 0 < |x —c| < 6 and z <c, then |f(x) — L| <«. 


Remark 4.3.11. Let ¢ be an interior point or an endpoint of an interval I. Now let 
f: 1 > R be a function. Corollary 4.3.6 implies the following: 


NE 


of distinct points in J where 


=f. 


(1) lim, f(x) = L if and only if for every sequence (s7, 
wr—->>C 


Na 


C < S for alln EN, if lim s, =c, then lim f(sp 
noo noo 


nN 


of distinct points in J where 


=f. 


(2) lim f(x) = Lif and only if for every sequence (5, 
LCT 


ae 


Sn <cfor alln EN, if lim s, =c, then lim f(sp 
noo noo 


Let f: J — R bea function on an interval J and let c be a left endpoint of J. In this 
case, Definition 4.3.1 and Definition 4.3.10(1) are equivalent, that is, lim ae) Sk 


if and only if lim, f(x) = L. The analogous equivalence also holds when c is a right 
wr—->C 


endpoint of J. However, this is not quite the case when c is an interior point. 

The next theorem shows that in order for the limit of a function to exist at an 
interior point, both the limit from the right and the limit from the left must exist 
and produce the same value. 


Theorem 4.3.12. Let f: J — R, L € R, and let c be an interior point of J, an 
interval. Then lim f(x) = L if and only if lim f(x) = lim, Fla): = te 
xc LC xL—->C 


Infinite Limits 
A function f that is not defined at a point c may have values f(x) that become very 


large as x gets very close to c. 


Definition 4.3.13. Let f: D — R and let c be an accumulation point of D. We 
shall write lim f(x) = oo if and only if for every M > 0 there exists a 6 > 0 such 


that for all c € D, if 0 < |x —c| < 6, then f(x) > M. 


Notice that lim 1/2? = co. However, lim, 1/x = cand lim 1/x% = —oo (see Ex- 
=> 20 r—0- 
ercise 23(b)(c)). It follows that lim 1/x is meaningless in the sense of Definition 4.3.13. 
coe 
The definition of lim f(x) = —oo is similar to Definition 4.3.13, and will be left to 


the reader to formulate (see Exercise 23). 
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Limits at Infinity 


On the other hand, as x gets very large, the values f(a) may become very close to a 
real number, or the values f(z) may become very large. 


Definition 4.3.14. Let f: (a,oo) —~ Rand LER. Then lim f(x) = L if and only 


if for every € < 0 there exists a M > a such that for all x € (a,oo), if x > M, then 
f(x) —L] <e. 


Definition 4.3.15. Let f: (a,co) > R. Then lim f(x) = oo if and only if for every 
K > 0 there exists a M >a such that for all x € (a,0o), if x > M, then f(x) > K. 


Exercises 4.3 


1. 


Use Definition 4.3.1 to prove each of the limits. 
(a) lim 2? —-3x2+1=11 

rd 
(b) lim 2?+2r+7=7. 

r——2 


. Let f: D— R be a function and let c € D. Suppose that c is an accumulation 


point of D. Prove that f is continuous at c if and only if lim Caw ares 


. Let f: R > R be defined by 


aati ifxEeQ 
Ae ey ifcxéQ. 


Show that for every c € R the limit of f at c does not exist. 


. Let f: D — Rand let c be an accumulation point of D. Prove that if lim Cha oe 


where L is a real number, then there is a 6 > 0 and an M > 0 such that 
| f(a)| <M whenever 0 < |ja—c| <d and ze D. 


. Let f: D— R and let c be an accumulation point of D where lim Ft) da 


and L is a real number. Prove that there is an ¢ > 0 and a 6 > 0 such that for 
all x € D, if 0 < |x —c| < 6, then |f(x)| > «. 


. Let f: D— R and let c be an accumulation point of D where lim Jia] LS0 


and L is a real number. Prove that there is a 6 > 0 such that f(x) > 0 when 
0<|a—c|<dandze D. 


. Complete the proof of Theorem 4.3.7, that is, prove items (2)—(4) of this theorem. 
. Prove Theorem 4.3.8. 

. Prove Theorem 4.3.9. 

. Let f: DR, c be an accumulation point of D, and lim f(z) =LeER. 


(a) Suppose that f(x) > 0 for all x € D. Prove that L > 0. 
(b) Suppose that f(x) < 0 for all x € D. Prove that L <0. 
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“11. 


12. 


13. 


14. 


15. 
16. 
"17. 


18. 


*19. 


20. 


21. 


22. 


*23. 


24. 


Let f: J > R, were J is an interval, and let a € J and b € I be such that a < b. 
Suppose that lim fay < lim f(a). Prove that there exists c,d € I such that 
wa x«w—> 


a<c<d< band f(c) <0< f(d). 


Let f: J > R be increasing on an interval J. Let c € I be an interior point. Let 
S={f(z):a2€lI and « < c}. Prove that 8 = sup(S) exists and lim f(x) = 8. 
LCT 


Let f: J > R be increasing on an interval J. Let c € I be an interior point. Let 
T={f(x):2¢€landc< cz}. Prove that a = inf(T) exists and lim, f(z) =a. 
zw->C 


Let f: J > R, were J is an interval. Let c be an interior point of J. Suppose that 
f is increasing on J. Prove that lim f(x) < f(o) < lim, F(S). 
H oe gO w—->C 


Prove Corollary 4.3.6. 
Prove Theorem 4.3.12. 


Let f: D— Rand let c be an accumulation point of D. Prove that lim $e). = 60 
zc 
if and only if for every sequence (s,) of points in D \ {c}, if lim Sn = c, then 
n [e.e) 
eine aay, 
Let f: D — R, c bean accumulation point of D, and (s,,) be a sequence of points 


in D that converges to c. Prove that if lim f(x) = oo, then for every < > 0, there 
x Cc 


is an N € N such that 755 <€ for alln > N. 


Let f: (a,00) > R and LER. Prove that lim f(x) = L if and only if for every 


sequence (s,,) of distinct points in (a, oo), if Jim Sn, = 00, then Jim Csi ee 


Let f: (a,co) > R. Prove that lim f(x) = oo if and only if for every sequence 


(Sn) of points in (a, co), if lim Sn = 00, then lim $USin.) = Oo: 
Let h: R > R be a function and let a € R. Suppose that lim, hie) =. 
wa 


(a) Prove that if h is odd, then lim h(x) =—L. 


wr —-a 


(b) Prove that if h is even, then lim h(x) = L. 


rl —-aT” 
Let f: D> Rand g: D->R, and let c be an accumulation point of D. Suppose 
that there is an M > 0 such that |g(x)| < M for all x € D, and lim Ter) St 


Prove that lim (fg) (x) =0. 


Provide definitions for the following limit concepts: 


(a) lim f(e)=-00. (0) lim f(@)=-00. —(@) Jim fe) = 00. 
(b) lim f(z)=00. (a) lim f(@)=E. —(f) lim f(r) =00. 


Let f: (—oo,b) > R. So, the composition f(—2) is defined on (—8, 00). 
that jim f(x) = L. Show that lim f(—ax) = L. 
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Exercise Notes: For Exercise 5, let ¢ = A For Exercise 6, let ¢ = L. For Exercise 8, 
use Theorem 4.3.4 and Theorem 3.2.8. For Exercises 10, use Theorem 4.3.4 and 
Theorem 3.2.14. For Exercise 11, review the proof of Lemma 4.1.14. For Exercise 21, 
apply Remark 4.3.11. In particular, for (a), let (s,) be a sequence of points in R such 


that s, < —a for all n € N. Suppose that lim Sn = —a. Show that lim h(Sn) = —L. 


4.4 CONSEQUENCES OF CONTINUITY 


In this section we establish several important properties that continuous functions 
possess. First we review some relevant features that have been shown to hold for 
sequences and continuous functions. 


Concept Questions: Let f: [a,b] > R be continuous and let (7,,) be a sequence of 
points such that x, € [a,b] for all n > 1. 


1. What can you conclude about the sequence (x,) from Theorem 3.5.1? Answer: 
(Zn) has a convergent subsequence. 


2. If the sequence (f(x,,)) converges, what can you conclude from Theorem 3.1.24? 


Answer: (f(2,)) is bounded. 


3. If (ap) has a subsequence (2,,) that converges to c. What can you infer about c 
from Corollary 3.2.15? Answer: c € [a,b]. What can you infer from Theorem 4.2.1 
about the sequence (f(x, ))? Answer: jim iia Hae): 

00 


4. Ifa<c< band f(c) > 0, then what can you infer from Lemma 4.1.14(a)? 
5. Ifa<c< band f(c) < 0, then what can you infer from Lemma 4.1.14(b)? 


Recall that a function f: [a,b] — R is bounded if and only if there is a real 
number M > 0 such that | f(x)| < M for all x € [a,b] (see Remark 2.3.15). 


Theorem 4.4.1. If f: [a,b] > R is continuous, then f is bounded. 


Proof. Let f: [a,b] + R be continuous. Suppose that f is unbounded. Thus, for all 
n EN, there exists an x, € [a,b] such that | f(v,)| > n. So the sequence (f(xn)) is 
unbounded and (A) any subsequence of (f(a,,)) is unbounded. Consider the sequence 
(tn) of points in [a,b]. Theorem 3.5.1 and Corollary 3.2.15 imply that (x,) has a 
subsequence (X,,) that converges to a point c € [a,b]. Theorem 4.2.1 implies that 
im f(tn,) = f(c). By Theorem 3.1.24 (f(ap,)) is bounded, contradicting (A). 


4.4.1 The Extreme Value Theorem 


The extreme value theorem states that if a function is continuous on a closed interval, 
then the function attains maximum and minimum values. This theorem is usually 
stated without proof in most calculus books. In this section, we shall prove this 
important result. 


Definition 4.4.2. Let f: D— R be a function. We say that 


120 m Real Analysis: With Proof Strategies 


e f attains a maximum value on D if there is c € D such that f(x) < f(c) for 
all x € D, that is, sup(f|D]) = f(c) for some c € D; 


e f attains a minimum value on D if there is d € D such that f(d) < f(x) for 
all x € D, that is, inf(f|D]) = f(d) for some d € D. 


We will next show that a continuous function, whose domain is a closed and 
bounded interval, attains both a maximum and minimum value. 


Theorem 4.4.3 (Extreme Value Theorem). If f: [a,b] + R is continuous, then f 
attains maximum and minimum values on [a, 0]. 


Proof. Let f: [a,b] + R be continuous. By Theorem 4.4.1, the set f([a, ]) is bounded. 
Let M = sup(f((a,])). For each n EN, let x, € [a,b] be such that M — + < f(a). 
Thus, M — f(an) < 4 for all n > 1. Since f(x) < M for all x in [a, 6], we see that 
|f(@n) — M| < + for every n > 1. Thus, (A) Jim f (tn) = M (see Theorem 3.1.11). 

By Theorem 3.5.1 and Corollary 3.2.15, the sequence (x7) has a subsequence (an, ) 
that converges to a point c € [a,b]. Since f is continuous, Theorem 4.2.1 implies that 


lini Fi) = 7 eo) (4.16) 


k- oo 


Moreover, (f(2n,,)) is a subsequence of (f(x,)). Theorem 3.3.4 and (A) thus imply 

that jim f(@n,) = M. So, from (4.16), we have f(c) = M and therefore, f attains a 
00 

maximum value on [a, 6]. A similar argument, using m = inf(f([a, b])), shows that f 

attains a minimum value on [a, J. 


The conclusion of Theorem 4.4.3 may not hold if the closed interval [a, b] is replace 
by an open interval. For example, let f: (0,1) — R be the continuous function defined 
by f(x) =x. The function f does not attain a maximum value on (0, 1). 


4.4.2 The Intermediate Value Theorem 


The Intermediate Value Theorem proclaims that if a continuous function, on an 
interval, attains two distinct values, then it also attains each value strictly between 
these two values. Before proving this theorem, we will first establish a key lemma 
which has a simple geometric interpretation. Let f: J — R be continuous where [ 
is an interval. Let a,b € I be such that a < b. Now suppose that f(a) < 0 < f(b). 
Since f is continuous, the graph of f moves from the point (a, f(a)) to the point 
(b, f(b)) without any breaks. So the graph must cross the x-axis at a point c between 
the points a and b. So f(c) = 0 (see Figure 4.2). In spite of its apparent simplicity, 
the proof of this lemma requires the completeness axiom. 


Lemma 4.4.4 (Intermediate Root Lemma). Let f: J — R be continuous where I 
is an interval. Let a,b € I be such that a < b. If f(a) < 0 < f(b), then there is a 
c € (a,b) such that f(c) = 0. 


Proof. Let f: I + R be continuous, a,b € I be such that a < b, and f(a) <0 < f(b). 
Let S = {x € [a,b] : f(x) < O}. As f(a) < 0, the set S is nonempty. Since a < x < b 
for all x € S, we conclude that S is bounded, c = sup(S) exists, and c € [a, 0]. 
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Figure 4.2: Illustration for Lemma 4.4.4. 


Claim. f(c) = 0. 


Proof of Claim. Suppose, for a contradiction, that either f(c) > 0 or f(c) < 0. 
CASE 1: f(c) > 0. Lemma 4.1.14(a) implies that there is a 6 > 0 such that 


f(x) > 0 for all x € [a,b] satisfying |x — cl < 6. (4.17) 


Since c = sup(.S) and c— 6 < , there is an x € S such that c—d <xanda<c. 
Thus, x € [a,b] and |x —c| < 6. So f(x) > 0 by (4.17). Since x € S, we also have 
that f(x) < 0. This contradiction implies that f(c) # 0. 


CASE 2: f(c) < 0. Lemma 4.1.14(b) implies that there is a 6 > 0 such that 
f(x) <0 for all x € [a,b] satisfying |x — cl < 6. (4.18) 


Asc <b, f(c) < 0, and 0 < f(b), we have that c < b. Thus, c < min{b,c+ 6}. Let 
x be such that c < x < min{b,c+ 6}. Thus, x € [a,b] and |x — c| < 6. Hence, (4.18) 
implies that f(x) < 0. So x € S. Since c < z, this contradicts the fact that c is an 
upper bound for S. Thus, f(c) ¢ 0. 


Therefore, f(c) = 0 and this completes the proof of the claim. 


By the Claim, f(c) = 0 where c € [a,6]. Since f(a) < 0 < f(b), we must have 
that a < c < b and thus, c € (a,b). The proof of the lemma is now complete. 


For a simple application of Lemma 4.4.4, let f: [2,3] — R be the continuous 
function defined by f(x) = 2° — 32x — 3. Since f(2) < 0 < f(3). There is ac € (2,3) 
such that f(c) = 0. 

The following definition resembles the statement of the Lemma 4.4.4, except that 
we allow f(a) and f(b) to be any two distinct values—not just one negative and the 
other positive (see Figure 4.3). 


Definition 4.4.5. Let J be interval. A function f: J > R has the intermediate 
value property on I if the following condition holds: Whenever a,b € J are such 
that a < band k is any real number strictly between f(a) and f(b), there is a c € (a, b) 
such that f(c) = k. 


The following theorem generalizes Lemma 4.4.4. This generalization will be ap- 
plied in this section and in later sections of the text. 
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Figure 4.3: Illustration for Theorem 4.4.6. 


Theorem 4.4.6 (Intermediate Value Theorem). Let J be an interval. If f: [+R 
is continuous, then f has the intermediate value property on J. 


Proof. Let f: I — R be continuous and let a,b € I be such that a < b. Lettk ER 
be strictly between f(a) and f(b). If f(a) < k < f(b), then let g: I — R be defined 
by g(x) = f(x) —k. Because f is continuous, it follows that g is continuous. Since 
g(a) = f(a) -—k <0 < f(b) —k = g(b), Lemma 4.4.4 implies that g(c) = 0 for some 
c € (a,b). Hence, g(c) = f(c) — k = 0 and therefore, f(c) = k. If f(a) > k > f(b), 


then a similar argument applies using the function g(x) = k — f(z). 


The Intermediate Value Theorem is an important theoretical tool that is used to 
prove a number of results in calculus and in real analysis. For example, it can be used 
to show that every positive real number has a positive nth root. To see this, let k > 0 
and let n € N. Let f: [0,k+1] — R be the continuous function defined by f(x) = x”. 
As f(0) <k < f(k+1), Theorem 4.4.6 implies that there is a c € (0,k+1) such that 
f(c) =c" =k. Thus, c is the nth root of k. 

One may suspect that the converse of Theorem 4.4.6 holds; that is, if a function 
has the intermediate property, then it must be continuous. However, this is false. Let 


f: [0,1] + R be defined by f(x) = sin(4) for x # 0 and f(0) = 0. Then f has the 
intermediate value property on [0,1], but it is not continuous at 0 (see Exercise 13). 
The Intermediate Value Theorem implies that the continuous image of an interval 


is an interval. 


Theorem 4.4.7. Let J C R be an interval. Suppose that f: J — R is continuous 
and nonconstant. Then f{[J] is an interval. 


Proof. This follows from Theorem 4.4.6 and Definition 1.1.1 on page 2. 


Our next theorem shows that the continuous image of a closed interval is a closed 
interval. 


Theorem 4.4.8. Let f: [a,b] + R be continuous and nonconstant. Then there exists 
real numbers m < M such that f([a,6]) = [m, /]. 
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Proof. Let f: [a,b] — R be continuous and nonconstant. By Theorem 4.4.3 and 
Definition 4.4.2, there are c,d € [a,b] such that 


f(c) = M =sup(f([a, 4))) and f(d) = m = inf(f([a, }])). 


So m,M € f({a,b]). Since f is a nonconstant function, m < M. Asm < f(r) < M 
for all x € [a,}], it follows that f([a,b]) C [m,M]. Moreover, Theorem 4.4.7 implies 
that f((a,6]) is an interval. As m,M € f({a,0]), it follows that [m,M] C f({a, 6). 
Hence, f([a, 6]) = [m, M]. 


We end this section with another application of Lemma 4.4.4. 


Theorem 4.4.9 (Fixed Point Theorem). Let f: [a,b] — [a,b] be continuous. Then 
there is a point c in [a,b] such that f(c) =. 


Proof. If f(a) = a or f(b) = 6, then the result holds. Suppose that f(a) 4 a and 
f(b) Z b. Since f: [a,b] — [a,b], it follows that a < f(a) and f(b) < 6. Now consider 
the continuous function g: [a,b] + R defined by g(x) = x— f(x). Since a < f(a) and 
f(b) < b, we see that g(a) < 0 < g(b). Lemma 4.4.4 implies that there is a point c in 
[a, 6] such that g(c) = 0. Thus, f(c) =. 


Exercises 4.4 


1. Prove Theorem 4.4.7. 


2. Let f: [a,b] > R be continuous. Suppose that inf(f([a, }])) < k < sup(f (a, d])). 
Show that there exists an x € [a, 6] such that f(x) =k. 


3. Give an example of a continuous function on an open interval that is not bounded. 


4. Let T= {x €Q:1 <a < 3}. T is called a closed Q-interval (see page 51). 
Define f: Z > Q by f(x) = x? — 2. Observe that f(1) < 0 < f(3). Show that 
there is no c € Z such that f(c) = 0. 


5. Let f: J > R where J is an interval. Suppose that for all a,b € J, if a < b, then 
f([a, b]) is an interval. Prove that f has the intermediate value property on I. 


6. Given that f(x) = sin(z) is a continuous function, show that there is an open 
interval whose image under f is not an open interval. 


7. Consider the function f: [0,2] > R defined by f(x) = 24 — 3x? + 1. Show that 
there are distinct points x,y € [0,2] such that f(x) =0 and f(y) =0. 


8. Complete the proof of Theorem 4.4.3; that is, prove that f attains a minimum 
value on [a, }]. 


9. Let f: [0,1] — R be a continuous function. Suppose that f(0) = f(1). Prove 
that there is a c in [0, 4] such that f(c) = f(c+ $). 


10. Let f: [a,b] > R be a continuous function. Suppose that f(x) € Q, for all x in 
[a, 6]. Prove that f is constant on |a, }]. 
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11. Let f: [a,b] > Rand g: [a,b] — R be continuous. Suppose that f(a) < g(a) and 
f(b) > g(b). Prove that there is a point c in (a,b) such that f(c) = g(c). 


12. Show that there exists an x € (0,7/2) such that cos(x) = a. 
*13. Let f: [0,1] — R be defined by 


_ fsin(4), if 40; 
es i if x =0. 


Consider the sequence (xa 73 : Show that f is not continuous at 0. 


Exercise Notes: For Exercise 8, since m = inf(f([a,b])), we see that for all n EN, 
there is an x, € [a,b] such that f(z,) < m+ 4. For Exercise 9, consider the function 
h: [0,3] + R defined by h(x) = f(x) — f(x + 4). Explain why h is continuous. 


4.5 UNIFORM CONTINUITY 


Recalling Definition 4.1.1, a function f: D — R is continuous on D when for each 
c € D and for every € > 0, there exists a 6 > 0 such that for all x € D, if |x —c| <6, 
then | f(a) — f(c)| < ¢. Given two points c,d € D and an ¢ > 0, the 6 that you find 
to establish the continuity of f at c may not work to establish the continuity of f 
at d. That is, one may have to find a 6’ 4 6 which will show that f is continuous 
at d. When you can find a 6 that works for all the points in D simultaneously, we 
shall say the function f is uniformly continuous.” 

Definition 4.5.1. A function f: D —> R is uniformly continuous on D when the 
following holds: For every ¢ > 0 there exists a 6 > 0 such that for all z,c € D, if 
|x —c| < 6, then |f(x) — f(c)| <e. 


Proof Strategy 4.5.2. Let f: D—R. To prove that f is uniformly continuous on 
D, use the proof diagram: 


Let ¢ > 0 be an arbitrary real number. 
Let 6 = (the positive value you found). 
Let « € Dandc€ D be arbitrary. 
Assume |x — c| < 6. 


Prove |f(x) — f(c)| <e. 


Example 4.5.3. Let f: R— R be defined by f(x) = 2x. Prove that f is uniformly 
continuous on R. 


Solution. To prove that f is uniformly continuous on R, we need to first do some 
“scratch work.” Let ¢ > 0. We must find a 6 that verifies the continuity of f at all 
points c € R simultaneously, that is, we must find a 6 such that for all c,x € R, if 


?The word uniform means: not changing in form or character; remaining the same in all cases 
and at all times. 
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|x —c| < 6, then |f(x) — f(c)| < ¢. Recall the strategy to prove continuity of f at c. 
Working with | f(x) — f(c)| we try to “extract out” |x —c|. Let us try this strategy 
for the function f(x) = 2x. 


|f(@) — fle)| = |2a — 2c| = 2|a — ¢|. 


By solving the inequality 2 |x — ¢| < e for |x — c|, we obtain |x — c| < §. Using 6 = §, 
we can prove that f is continuous at c. Notice that the 6 we found does not depend 


on c. Hence, we can also use this 6 to prove that f is uniformly continuous on R. 


Proof. Let f: R — R be defined by f(x) = 22. Let ¢ > 0. Take 6 = §. Let c,x € 
R and assume that |x —c| < 6. Thus, |f(z) — f(od)| = 2|e¢-—c| < 26 = 2& =e. 
Therefore, f is uniformly continuous on R. 


& 
2 


Let f: D > R and let c € D. Suppose that you can prove that f is continuous 
at c using Definition 4.1.1. In addition, suppose that the 6 you obtain depends on c. 
As a consequence, this 6 will work for c; but it may not work for all points in D. In 
this case, the function f may not be uniformly continuous on D. 

To prove that a function is not uniformly continuous can be a bit challenging 
because one has to establish the negation of Definition 4.5.1. We provide such a proof 
in our next example. To prepare us for this proof, we state the negation of uniform 
continuity in the following comment. 


Comment. Let f: D— R. Then f is not uniformly continuous on D if and only if 
there exists an € > 0 such that for all 6 > 0, there are x,c € D such that |x —c| <6 


and | f(x) — f(c)| 2 €. 


Example 4.5.4. Let f: R > R be defined by f(x) = x?. Prove that f is not 
uniformly continuous on R. 


Proof Analysis. We will apply the above comment. We need to first identify ¢. Let us 
try « = 1. Now, let 6 > 0 be arbitrary. We need to find c,x € R such that |x — c| < 6 
and | f(x) — f(c)| = |v +e||x —c¢| > ¢ =1. To make sure that |x — c| < 4, let us take 
(x) c =c+ 2 with c > 0. Hence, |x — c| = §. To make |x + ¢||x — c| > 1 hold, we 
need (after substituting) |a + c|% > 1; that is, we need |x + c| > 2. Since 2,¢ > 0 we 


6 
have that |x + c| > |c|. So, any c that satisfies |c| > = will do. So, we let c = 3 and 
x = 3 + 4 (see (*)). We can now prove that f is not uniformly continuous on R. 
Proof. Let f: IR — R be defined by f(x) = x”. Let ¢ = 1. Let 5 > 0. Let c= 2 and 
c=o+ a Clearly, |x — c| = 4 <6 and 
4 6\0 67 
Fo) =f) =|z+dle-d= (545) 5-24 75> 


Thus, |2 — c| < 6 and |f(a) — f(c)| > 1. Therefore, f is not uniformly continuous. 


We see from Example 4.5.4 that there are continuous functions that are not 
uniformly continuous on certain intervals. Nevertheless, there are conditions that 
will ensure that a continuous function is uniformly continuous. The following theorem 
presents one such condition: if the domain is a closed and bounded interval. 
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Theorem 4.5.5. Let a,b € R where a < b. If f: [a,b] — R is continuous, then f is 
uniformly continuous on |{a, }]. 


Proof. Let f: [a,b] + R be continuous. Suppose, for a contradiction, that f is not 
uniformly continuous on [a,b]. Thus, there exists an ¢ > 0 such that for all 6 > 0, 
there are xz,c € [a,b] such that |x —c| < 6 and |f(x) — f(c)| > e. Hence, for every 
natural number n > 1, there are points x, c, in [a,b] such that 


ee ns SIO es, (4.19) 


Since the sequence (z,,) consists of points in [a, 6], Theorem 3.5.1 and Corollary 3.2.15 
imply that (a) has a subsequence (7,,) such that jim In, = ¢ where c € [a,b]. 
00 


Consider the corresponding subsequence (¢,,,) of (Cn). By Lemma 3.3.3 and (4.19), 
we see that |%n, — Cn,| < x < i for allk > 1.So0 jim Cn, = c by Lemma 3.2.11. As f 


is continuous, Theorem 4.2.1 implies that jim fe) = f(c) and jim Hew) = fo: 
Thus, | f(@n,) — f(e)| < 5 and |f(cn,) — f(o)| < 5 for some k > 1. Therefore, 
f(an) — FlCna)| = [Fn,) — Fl) + FO) — Feng) 
S |F(tn,) — FO] + [flen) — FOL < 


So |f(2n,) — f(en,)| < €. However, (4.19) implies that |f(tn,) — f(cn,)| = ¢. This 
contradiction completes the proof. 


Let f: D > R be a continuous function. Theorem 4.2.1 implies that for any 
sequence (x7) of points in D, if (v,) converges to a point c € D, then the sequence 
(f(%n)) converges to f(c). However, if c ¢ D, then the sequence (f(x,)) may not 
converge. For example, let D = (0,2) and define f: (0,2) — R by f(x) = 2. The 
function f is continuous. Now consider the sequence of points in (0,2) given by (+) 
for all n € N. The sequence (+) converges to 0 while the sequence (f(+)) = (n) does 
not converge. The next theorem shows that this cannot occur for a function that is 
uniformly continuous. 


Theorem 4.5.6. Suppose that f: D — R is uniformly continuous. Let (xz,) be any 
sequence where x, € D for all n > 1. If (x,) converges, then (f(x,)) converges. 


Proof. Suppose that f: D — R is uniformly continuous. Let (x,) be a convergent 
sequence of points in D. By Theorem 3.6.5, (x) is a Cauchy sequence. We will 
now prove that (f(x,)) is also a Cauchy sequence. Let ¢ > 0. Since f is uniformly 
continuous, there is a 6 > 0 such that 


for all x,c € D, if |x — c| < 46, then |f(z) — f(o)| <«. (4.20) 


As (X) is a Cauchy sequence, there is an N such that |v, — x»,| < 6 for alln,m > N. 
Thus, (4.20) implies that |f(v,) — f(@m)| < ¢ for all n,m > N. Therefore, (f(2n)) 
is a Cauchy sequence. Theorem 3.6.5 now implies that (f(x,,)) converges. 


Definition 4.5.7. A function f: D > R preserves convergent sequences if for 
all convergent sequences (z,,) with points in D, we have that (f(x,)) converges. 
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Thus, by Theorem 4.5.6, each uniformly continuous function preserves convergent 
sequences. The converse also holds when the domain of the function is bounded. 
Before we record this in the form of a theorem, we present a supportive lemma. 


Lemma 4.5.8. Suppose that f: D — R preserves convergent sequences and that 

(Zn) and (cp) are convergent sequences of points in D. If lim z, = lim cp, then 
: . Noo nN—- Oo 

ee aes er 

Proof. Suppose that f: D — R preserves convergent sequences and assume that 

Jim iw Jim Cn, Where (%,) and (cy) are two sequences of points from D. By 


Proposition 3.3.9, the sequence (21, C1, £2, C2, ©3,C3,...) converges. Since f preserves 
convergent sequences, we conclude that sequence 


(f(x1), f(a), f (x2), f(e2), f (x3), f(es), as -) 


converges to a number L. Thus, lim He =a LS lim f(cn), by Theorem 3.3.4. 


Theorem 4.5.9. Suppose that f: D — R preserves convergent sequences. If D is 
bounded, then f is uniformly continuous. 


Proof. See Exercise 8. 


Exercises 4.5 
1. Let m and c be real numbers where m 4 0. Consider the function f: R > R 
defined by f(x) = ma +c. Prove that f is uniformly continuous. 


2. Find a function f: D > R such that f is uniformly continuous, but f? is not 
uniformly continuous. [Thus, the product of uniformly continuous functions is 
not necessarily uniformly continuous. | 


3. Prove that f: IR > R defined by f(x) = x? is not uniformly continuous on R. 


4. Let f: D— Rand M > Obesuch that | f(x) — f(y)| < M |x —y|forallz,y € D. 
Prove that f is uniformly continuous. 


5. Suppose that f: D —> R is uniformly continuous and let & € R be nonzero. Prove 
that kf is uniformly continuous. 


6. Suppose that f: D — R and g: D > R are uniformly continuous. Prove that 
(f +9): D> R is uniformly continuous. 


7. Suppose that f: D — R is uniformly continuous. Let g: D — R be defined by 
g(x) =|f(x)|, for all x € D. Prove that g is uniformly continuous. 


*8. Prove Theorem 4.5.9. 


9. Let f: D > R be uniformly continuous and (x,) be a sequence of points in D. 
Prove that if (w,) is a Cauchy sequence, then (f(x,)) is Cauchy sequence. 


10. Prove that if f: D— R preserves convergent sequences, then f is continuous. 
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11. 


12. 


13. 


14. 


15. 


Let f: D— Rand g: E > R be functions such that f|D] C E. Prove that if f 
and g are uniformly continuous, then (go f): D > R is uniformly continuous. 


Let f: D> Rand g: D > R be uniformly continuous, where D is bounded. 
Using Theorems 4.5.6 and 4.5.9, prove that (fg): D — R is uniformly continuous. 


Let f: D > R be uniformly continuous. Suppose that y > 0 such that f(x) > y 
for all « € D. Prove that (F): D — Ris uniformly continuous. 


Suppose that f: D — R is uniformly continuous and D is bounded. Prove that 
f is bounded, using Theorem 4.5.6. 


Let f: D > R be uniformly continuous and let c ¢ D be an accumulation point 

of D. 

(a) Let (a) be any sequence of points in D that converges to c. Show that 
(f(%n)) converges. 

(b) Let (zn) and (yn) be sequences of points in D that both converge to c. Show 
that lim f(zn) = lim f(Yn). 

(c) Conclude that lim f(x) exists. 


Exercise Notes: For Exercise 1, review Example 4.5.3. For Exercise 3, review Ex- 


ample 4.5.4 and note that las -—c 


3) = |x? + xc + c?| |v —c|. For Exercise 6, review 


the proof of Theorem 4.1.6. For Exercise 8, modify the proof of Theorem 4.5.5 by 
appealing to Lemma 4.5.8. For Exercise 10, apply Lemma 4.5.8 and Theorem 4.2.1. 
For Exercise 11, modify the proof of Theorem 4.1.13. 


CHAPTER 5 


Differentiation 


An important foundational concept in calculus with respect to a function f is the 
quotient of two differences f@)- fe) called the difference quotient of f between 
the real numbers x and c. Of course, if « = c, then this difference quotient yields the 
meaningless ratio 8. So, in a difference quotient, we assume that the real numbers x 
and care distinct. As you may recall from calculus, the difference quotient is used to 
find the slope of a secant line to the graph of f, where the secant is the line joining 
the distinct points (c, f(c)) and (a, f(x)). This slope also identifies the average rate 
of change of f between c and x. By taking the limit of the difference quotient as 
x approaches c, we arrive at a mathematical precise definition of the slope of the 
tangent line to f at the point c (see Figure 5.1). 


slope = lim, Ma) — Lo) 


—¢ 
a 


(ce, fle ~ 
Cy oe 


Figure 5.1: Limit of difference quotients. 


In this chapter, we will focus on functions whose domain is an interval. For such 
functions, we shall present a theoretical treatment of the derivative and explore some 
of its properties. We also identify and establish three significant results that concern 
the derivative: The Mean Value Theorem, L’H6pital’s Rule, and Taylor’s Theorem. 


5.1. THE DERIVATIVE 


In this section, our attention will be on the definition of the derivative. We will present 
some formulas and procedures for finding derivatives. The key tool for establishing 
these procedures involves the algebraic manipulation of difference quotients. 
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Definition 5.1.1. Let f: J > R be a function where J is an interval. Let c € I. We 
shall say that f is differentiable at c to mean that the limit 


an £2) - £0 


LC ~T—C 
exists and is a real number denoted by f’(c). So, when f is differentiable at c, we 
shall write f(x) — fle) 
DN oc id Nee 
fd = lim —— 


The value f’(c) is the derivative of f at c. Let S C I. Then f is differentiable on 
S when f is differentiable at all points in S, and the function f’: S > R is called 
the derivative of f on S. Finally, f is differentiable if f is differentiable at every 
point in J and, in this case, f’: I > R is said to be the derivative of f. 


The limit operation lim G(x) (see Definition 4.3.1) only requires that c be an 


accumulation point of the domain of G. So in Definition 5.1.1, c can be an endpoint 
of the interval J, as c is an accumulation point of I. 


Example. Let f: R — R be defined by f(x) = x? + 1. Then for any c € R we have 


Fo) = tim LOL) by definition of f’(c) 
LC ~T—C 
2 2 
agi 1 

= lim = cored) ag genaiiony 

LC ~T—C 
ah (e+ ola — 0) by algebra 

LC r—C 
— lim (x +c) =2c by algebra and Theorem 4.3.7. 


Therefore, f’(c) = 2c. 


Example. Let f: R — R be defined by f(x) = |a|. For c = 0, lim el-l0l = 4 


aot 70 


and lim ED = —1. By Theorem 4.3.12, f is not differentiable at 0. However, f is 


x0 
continuous at 0. 


The above example shows that a function may be continuous at a point without 
being differentiable at the point. So continuity does not imply differentiability. On the 
other hand, our next theorem shows that if a function is differentiable at a point, then 
it is continuous at this point. For the remainder of the text, J denotes an interval. 


Theorem 5.1.2. If f: J > R is differentiable at c € J, then f is continuous at c. 


Proof. Let f: I > R be differentiable at c where c € J. To prove that f is continuous 
atc, we will show that lim f(x) = f(c). Since f is differentiable at c, we have that 
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Notice that for « € I where x 4 c we have 


f(a) = («9 LI-L9 5 56) 
Hence, 
flo) — H0) 


)+r0 (5.1) 


by Theorem 4.3.7. Since lim (x —c) = 0, equation (5.1) implies that lim pla Fhe): 


Hence, Lemma 4.3.3 implies that f is continuous at c. 


lim f(a) = (lim (e - c) (1im 


rc wc C-—C 


Corollary 5.1.3. If f: J > R is differentiable on J, then f is continuous on I. 


5.1.1. The Rules of Differentiation 


In general, computing the derivative of a function using Definition 5.1.1 is difficult. 
Fortunately, there are rules and procedures for computing derivatives. Our next next 
theorem establishes these familiar rules of differentiation that we first learned in 
calculus. Moreover, the proofs of these rules use Definition 5.1.1 and the algebraic 
maneuvering of expressions that involve difference quotients. After the proof of this 
theorem, we will give a proof of the chain rule. Recall that J represents an interval. 


Theorem 5.1.4. Let f: J — Rand g: I > R be differentiable at c € J. Then 


(1) (Constant Multiple Rule) For k € R, the function kf is differentiable at c and 
(kf)'(c) =k: fo). 
(2) (Sum Rule) The function f + g is differentiable at c and 
(f+ 9 =f +9(0. 
(3) (Product Rule) The function fg is differentiable at c and 
(f9)'() = Fg) + 9) FO). 


(4) (Quotient Rule) If g(c) 4 0, then the function f is differentiable at c and 


AV, _ alto) ~ fle)a'(o 
(Jo Ce 


g 
Proof. Assume that (a) f: J > Rand g: I > Rare differentiable at c € I. For parts 
(1) and (2), see Exercise 3. We will just prove (3) and (4). 


PROOF OF (3). We will first start with the difference quotient Goa) Fa) and 
algebraically “extract out” the difference quotients fe) fo) and alee) For x € I 


C6 
where x 4 c, we have 
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(f(z) — (fale) _ Fle)g(2) — Fg) by Def. 4.1.4(3) 


_ LFlw)a(e) = Fo)a(0) + LF@)9(0) = FIO], sreedra 
§ (y=. 4 qo fO=LO reas 
Thus, 
(F(2) = UNO _ p¢,90) =H) | yf) = 0 


L—C Z-—C ) Z-—C 
Theorem 4.3.7 implies that 


fm (F9)(@) = (Fao) _ (lim f(x)) (1im g(«) — 19) +4 (1im f(z) - ao * 


te Gee “Zc T—C rc ~CT—C 


Since f is continuous at c, we have that lim f(x) = f(c). In addition, by (a), we have 


that lim fe) 19) = f'(c) and lim ae)—ale) = g'(c). Thus, equation ¥% yields 


L— 


fm f= (F9) (6) 


LC Z—-C 


= f(99 (9 + 9) f'(0)- 


PROOF OF (4). Assume that g(c) 4 0. Theorem 5.1.2 and Lemma 4.1.14 imply 
that there is an interval J C I with c € J such that g(x) 4 0 for all x € J. Again, 
(s)e)- GY) 


C=C 


the difference quotients £ @)t and 2 @)-9 () For z € J where « #c, we have 


L— 


(Qe- (Qo _ #- #8 


we will start with the difference quotient and algebraically “extract out” 


_ g@) 9) 

ae ¢2 f() 
(x—c©) \g(x) gle) 

a isl (Sees — flojg(x ) 
(=<) g(x) g(c) 

wel (faite — flo)g(o)] — [Flog(x) — Hoetel) 
(o=¢) g(x) g(c) 

_ 1 (eclee — f(o)] — FOlgl#) Ho) 
g(x) g(c) (x —c) 

i fa) fo g(x) — g(c) 

rage (8 eaay HOES) 


Thus, 


(7)@ - (3) _ (AER - rose 


z—¢ g(x) g() 
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Theorem 4.3.7 now implies that 


— (Hea - (Qo _ 0 (tim 22=L2) - 10) (im °=2) 
lim L—-C 7 g(c) lim g(2) ey 


f()-fle) __ fi(c) 


Since lim — and lim ale)—a(e) = g'(c) as f and g are differentiable at c, 


equation (5.2), Theorem 5.1.2, and Lemma 4.3.3 yield the equation 
£ f 
(O- HO _ deste -foa'te) 


me ee gl 


5.1.2 The Chain Rule 


The chain rule is a procedure for differentiating the composition fog of differentiable 
functions f and g. Let J and J be intervals. Suppose that f: J ~ Rand g: J TJ, 
where g is differentiable at c € J and f is differentiable at g(c). Before we present 
the main theorem, let us assume that the function g is one-to-one. Then for all x and 
cin J, if «Ac, then g(x) — g(c) 4 0. Thus, Theorems 5.1.2 and 4.3.9 imply that 


= lim A= = F'(a(e)). (5.3) 


Hence, we have that 


(fo g(t) = (Fogo) _ 


lim = lim 
=e (Seg) te (eed 
a (Late) = gto) im (a : a) 
= f(g(o)s'(0 by (53). 


Thus, when g is one-to-one, the above proof shows that (f 0 g)/(c) = f’(g(c))g'(c). 
If g is not one-to-one, then we may have x # c and g(x) — g(c) = 0. Thus, the 
above argument would not hold, because we cannot divide by 0. The following proof, 
however, shows that the chain rule holds even if g is not one-to-one. 


Theorem 5.1.5 (Chain Rule). Let J and J be intervals. Suppose that f: J > R 
and g: J > I, where g is differentiable at c € J and f is differentiable at g(c) € I. 
Then (f og): J > R is differentiable at c and 


(fog)'(e) = f’(9(c)) -9'(O)- 
Proof. Since g is differentiable at c, we have that 


han 20-90) 


wc zxZ—-C 


=9'(c) (5.4) 
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and because f is differentiable at g(c), we have that 


m 19) = Fale) 
yg(c) y — g(c) 


= f'(g(c)). (5.5) 
Consider the function h: J > R defined by 
fw)-fl9)) | 
h(y) = y—g(c)? ify A g(c); 
f(g), ify =g9(0). 


Using equation (5.5) and the definition of h, one can see that a h(y) = h(g(c)).4 
yale 


Lemma 4.3.3 now implies that h is continuous at g(c), and Theorem 5.1.2 implies 
that g is continuous at c. Hence, ho g is continuous at c by Theorem 4.1.13 and so, 


lim(ho g)(x) = h(g(e)) = F'(9(0). (50) 
ene f(a(2))-F(9(0)) 
yal? 9(2) # ge); 
hog\(n) <hlq(a)) a2 ses) 
(ho g)(x) = h(g(x)) ret Peale). 
Thus, for x € J and x £c, we have (see Exercise 5) 
(fog)(a) — (Fogo) _ (ho g(a) (C2= 99), (5.7) 
Hence, 
lim (fo ue) z u og)(c) _ (1im(h a 9)(x)) (1im g( ) z (0) 


by Theorem 4.3.7(2). Therefore, equations (5.6) and (5.4) now yield the equation 


im (f° 9) (2) — (Fe glo) 


w—->C zZ—-C 


= f'(g(0)) -9'(0). 


Exercises 5.1 
1. Let f: Ré — R* be defined by f(x) = Wz. Using Definition 5.1.1, find the 
derivative f’(c) when c > 0. 


2. Let f: R \ {0} > R be defined by f(x) = 4. Using Definition 5.1.1, find the 
derivative f’(c) when c 0. 


*3. Prove parts (1) and (2) of Theorem 5.1.4. 
4. Let f: R- Rand g: R — R be differentiable functions. Suppose that f(2) = 4, 
/ 
g(2) = 2, f"(2) =5, and g/(2) = —3. Find (fg)'(2), (£) (2), and (f © g)'(2). 


'The notation lim implies that y 4 g(c). 
yg(c) 
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*5. Show that (5.7), in the proof of Theorem 5.1.5, holds for all x € J when x #c. 


6. Let J be an interval. Suppose that f and g are differentiable functions on J. 


Suppose that f(x)g(x) = 1 for all x € I. Show that ey - + oa = = 0 forallze€ IJ. 


7. Let J be an interval and f be a differentiable function on J such that f(x) < f(y) 
whenever x < y are in J. Prove that f’(c) > 0 for all c € J. 


8. Let f: R > R be differentiable on R. 


(a) Prove that if f an even function, then f’ is an odd function. 


(b) Prove that if f an odd function, then f’ is an even function. 


9. Let f: R > R be defined by 


@ sins )a irae 0; 
0, if =0. 


(a) Using the chain and product rules, show that f is differentiable at any c £ 0. 
Find f’(c). 
(b) Using Definition 5.1.1, show that f is differentiable at c = 0. Find f’(0). 


(c) Show that f’ is not continuous at c = 0. 


10. Let f: R — R be defined by 


Evaluate lim _ ar) and lim f@)-10) Is f is differentiable at 0? If so, then 


r—0+ x07 


find f/(0). 


Exercise Notes: For Exercise 5, there are two cases to consider: g(x) = g(c) and 
g(x) # g(c). For Exercise 7, use Exercise 10 on page 117. For Exercise 9(b), see 
Exercise 22 on page 118. This exercise shows that the derivative of a differentiable 
function may not be continuous. 


5.2 THE MEAN VALUE THEOREM 


The Mean Value Theorem is one of the most important theorems in calculus, and it is 
a key tool in the proof of the Fundamental Theorem of Calculus (see Theorem 6.4.2). 
In this section, we establish the Mean Value Theorem and derive several of its conse- 
quences. We also identify and prove Cauchy’s Mean Value Theorem, a generalization 
of the Mean Value Theorem. 


Theorem 5.2.1. Let f be a differentiable function on the open interval (a,b). If f 
attains a maximum value or minimum value at a point c € (a,b), then f’(c) = 0 
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Proof. Let f be a differentiable function on the open interval (a,b). We shall assume 
that f attains a maximum value at c € (a,b). [The proof for the case that f attains a 
minimum value is similar.] Since f(c) is a maximum value, we have that f(x) < f(c) 
for all x € (a,b). We shall prove that f’(c) = 0. Let (a,) be a sequence of points in 
(a,b) that converges to c such that c < x, for all n € N. Theorem 4.3.4 implies that 


; f (Zn) =! f(o) / 
lin. : 
a ae (eo) 
Since f(t») < f(c) and c < a, for all n € N, it follows that fen)- HO < 0 for all 
n €N. Thus, Theorem 3.2.12 implies that f’(c) < 0. 
Also, let (yn) be a sequence of points in (a, b) that converges to c such that yn < ¢ 
for all n € N. Theorem 4.3.4 implies that 


Since f(y¥n) < f(c) and c > yp for all n € N, it follows that Bete > 0 for all 
n €N. Thus, f’(c) > 0. Since f’(c) > 0 and f’(c) < 0, we conclude that f’(c) = 0. 


As often occurs in mathematics, one first proves a special case of a more general 
theorem. Then the special case is used to prove the general theorem. The special case 
that we next prove is called Rolle’s Theorem, and this result will be used to prove 
the Mean Value Theorem. 


Theorem 5.2.2 (Rolle’s Theorem). Let f: [a,b] > R be a continuous function that 
is differentiable on the open interval (a,b). If f(a) = f(b), then there is a point 
c € (a,b) such that f’(c) = 0. 


I 
I 
I 
I 
| 
l 
a Cc b 


Figure 5.2: Illustration for Theorem 5.2.2. 


Proof. Let f: [a,b] + R be a continuous function that is differentiable on the open 
interval (a,b). Assume f(a) = f(b). If f were a constant function, then f’(c) = 0 for 
all c in [a, b]. So, assume that f is not a constant function. Let k = f(a) = f(b). As f 
is not a constant function, there is an x in (a,b) where f(x) # k. Since f: [a,b] + R 
is continuous, Theorem 4.4.3 implies that f attains a maximum value or minimum 
value at a point c € (a,b). Theorem 5.2.1 asserts that f’(c) = 0. 
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As portrayed in Figure 5.2, Rolle’s Theorem 5.2.2 implies that the graph of any 
differentiable function which has the same value at two distinct points must have a 
tangent line that is parallel to the z-axis. Using Theorem 5.2.2, we will now state 
and prove the Mean Value Theorem, 


Theorem 5.2.3 (Mean Value Theorem). Let f: [a,b] > R be a continuous function 
that is differentiable on the open interval (a,b). Then there is a point c € (a,b) such 
that. f'(¢)= 0) fa) 


Proof. Let f: [a,b] + R be a continuous function. Suppose that f is differentiable 
on the open interval (a,b). Define the function h: [a,b] > R by 


pT e—a) + F(a). (68) 


Note that h is a continuous on [a,b] and differentiable on (a,b). Moreover, h(a) = 0 
and h(b) = 0. Theorem 5.2.2 implies that there is a c € (a,b) such that h’(c) = 0. 


Equation (5.8) implies that h’(c) = f’(c) — FO) F(a) Ha) _ =). The, 7 (c= FO) Fa) Ha) 


f(b) 


secant line 


] 
I 
| 
| 
! 
\ 
I 
\ 
\ 
1 
\ 
I 
! 
b 


Figure 5.3: Illustration for Theorem 5.2.3. 


Figure 5.3 offers a geometric interpretation of Theorem 5.2.3. The value FO) F(a) 
is the slope of the secant line between the points (a, f(a)) and (b, f(b)). The theorem 
states that there is a c € (a,b) such that the slope of the associated tangent line, 
f’(c), is equal to the slope of this secant line. 

The Mean Value Theorem can be used to derive inequalities. For example, let 0 < 
a < b. We will prove that b? — a? < 3b?(b—a). Let f: [a,b] > R be defined by f(x) = 
x°. Then f is continuous on [a,b] and differentiable on (a,b). By Theorem 5.2.3, 
there is a c € (a,b) such that b? — a? = 3c?(b— a). Since c < b, we conclude that 
b3 — a? < 3b?(b—a). 

We now show how the Mean Value Theorem implies some familiar relationships 
that hold between a function and its derivative. 


Theorem 5.2.4. Let f: [a,b] — R be a continuous function. Suppose that f is 
differentiable on the open interval (a, 6). If f’(xz) = 0 for all x € (a,b), then f isa 
constant function on [a, )]. 


138 m Real Analysis: With Proof Strategies 


Proof. Let f: [a,b] + R be a continuous function. Suppose that f is differentiable 
on the open interval (a,b). Assume that 


f'(x) = 0 for all x € (a,b). (5.9) 


Suppose, for a contradiction, that f is not constant of [a,b]. Thus, there are y, z in 


[a, 6] such that y < z and f(y) 4 f(z). So, 
f(z) = FY) 
Y 


Z- 


40. (5.10) 


Consider the closed interval |y, z]. Since f is continuous on [y, z] and differentiable on 
(y, z), there is ac € (y, z) such that f’(c) = fa)afy! by Theorem 5.2.3. Hence, (5.10) 
implies that f’(c) 4 0. However, (5.9) implies that f’(c) = 0, a contradiction. 


Corollary 5.2.5. Let f: [a,b] > Rand g: [a,b] — R be a continuous functions that 
are differentiable on the open interval (a,b). If f’(2) = g'(a) for all x € (a,b), then 
there is a constant C' € R such that f(x) = g(x) + C for all x € [a, J. 


Proof. Let f: [a,b] > R and g: [a,b] > R be a continuous functions. Suppose that 
f and g are differentiable on the open interval (a,b). Assume that 


f'(x) = g'(z) for all x € (a, 0). (5.11) 
Let h: [a,b] + R be defined by h(x) = f(x) — g(x). Since h'(x) = f’(x) —g'(z), (5.11) 


implies that h’(x) = 0 for all x € [a,b]. Thus, by Theorem 5.2.4, there is a constant 
C such that h(x) = C for all x € [a,b]. Hence, f(x) = g(x) + C for all x € [a, O]. 


In the proof of Theorem 5.2.8 below, we use the Mean Value Theorem to establish 
a connection between the derivative of a function and the function’s behavior. First, 
we need to identify these particular behaviors. Recall that “iff” is an abbreviation for 
“if and only if” 


Definition 5.2.6. Let f: J > R be a function where J is an interval. 
e f is strictly increasing on J iff for all y,z € I, if y < z, then f(y) < f(z). 
e f is increasing on J iff for all y,z € J, if y < z, then f(y) < f(z). 
e f is strictly decreasing on J iff for all y,z € J, if y < z, then f(y) > f(z). 
e f is decreasing on J iff for all y,z € I, if y < z, then f(y) > f(z). 


Theorem 5.2.7. Let J be an interval. If f: J > R is strictly increasing or strictly 
decreasing, then f is one-to-one. 


Theorem 5.2.8. Let f: J — R be a function where J is an interval. Suppose that 
f is differentiable on J. Then the following hold: 


(a) If f’(~) > 0 for all x € J, then f is strictly increasing on I. 
(b) If f’(x) < 0 for all x € J, then f is strictly decreasing on J. 
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Proof. Let f: I + R be differentiable on an interval J. We prove (a) and leave (b) as 
an Exercise 12 (the proof is very similar). Assume that (a) f/(x) > 0 for all x € J. 
Let y, z € I be such that y < z. We must show that f(y) < f(z). By Corollary 5.1.3 
and the Mean Value Theorem 5.2.3, there is a c € (y, z) such that 


f(z) — fy) = fe - 9). 


By (4) we have that f’(c) > 0. Since z — y > 0, we conclude that f(y) < f(z). 


We now extend the Mean Value Theorem to one that involves two functions f 
and g. The proof of this extension relies on the introduction of a new function h. 


Theorem 5.2.9 (Cauchy Mean Value Theorem). Let f: [a,b] + Rand g: [a,b] > R 
be continuous on [a, 6] and differentiable on (a,b). Then there is c € (a,b) such that 


[F(b) — F(a)]g’(©) = [9(b) — gla) f'(e)- (5.12) 


= (5.13) 


Proof. Let f: [a,b] > R and g: [a,b] > R be as stated. Define h: [a,b] — R by 
h(a) = (f(b) — F(a))g(@) — (9(®) — g(@)) (2). 


Hence, h is a continuous on [a,b] and differentiable on (a,b). In addition, h(a) = 0 
and h(b) = 0. By Theorem 5.2.2, there is a c € (a,b) such that h’(c) = 0. Thus, 


which implies (5.12). Finally, if g’(x) 4 0 for all x € (a,b), then Theorem 5.2.2 implies 
that g(b) 4 g(a). Therefore (5.12) implies (5.13). 


5.2.1 LHépital’s Rule 


As you may recall from calculus, the “indeterminate forms” ° and & can result from 


attempting to evaluate the limit of a ratio of functions. L’H6pital’s Rule offers a 
method for evaluating such limits (if they exist). We will prove, independently, four 
versions of L’H6pital’s Rule using Theorem 5.2.9 and Theorem 4.3.4. The first two 
rules address the indeterminate form 2, while last two rules deal with the form 2. 
L’H6pital’s rules are fairly easy to apply; however, their proofs are fairly involved. 


Theorem 5.2.10 (L’H6pital’s Rule I). Let f and g be continuous on an interval J, 
and differentiable on I except possibly at c € I. Suppose that f(c) = g(c) = 0 and 


f so f(a) _ stag fe) 
g(x) £0 for all x € I \ {c}. If lim aay Land LER, then im Ca L. 
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Proof. Let f and g be continuous on J and differentiable on J \ {c} where c € J and 
T is an interval. Suppose that 


ie) =9(¢) <0. (5.14) 
Assume that g'(x) 4 0 for all x € I \ {c} and L € R is such that 
EAD) og (5.15) 


ete g!(=) 
Let (s,) be a sequence of points in J \ {c} such that lim Sn = c. Theorem 5.2.9 
n [ee) 
implies that for each n € N, there is a point c, strictly between s, and c such that 


f= F(sn) _ fen) 


g(e) —alsn) — Fen) a 
Thus, equations (5.14) and (5.16) imply that 
sn) _ Fler) sn all nN. (5.17) 
Sn) 9’ (Cn) 


Since lim s, = c and each c, is between s, and c, we have that Jim, Cn = c by 
noo 


Corollary 3.1.13. Theorem 4.3.4 and equation (5.15) imply that tim a 7 Lie) = = L. 


From (5.17) we infer that jim n A} = = L. Hence, lim , AS = L by riieren 4.3.4, 


For each limit lim in Theorem 5.2.10, it is understood that x is in I \ {c}. We 


now present our second version of L’H6pital’s Rule, which involves a limit at infinity. 


Theorem 5.2.11 (L’H6pital’s Rule II). Let f and g be differentiable on (a, 00) such 
that lim f(2) = lim g(x) =O and g/(x) £ 0 for all x € (a, ov). If lim L@ —_ DT and 


g(a) 
LER, then lim fe) — 7, 
roo 9(2) 


Proof. Let f and g be differentiable on (a, oo) and satisfy the above conditions. Since 
g(x) # 0 for all x € (a,00), Rolle’s Theorem implies that there is a Dy > a such 
that g(x) 4 0 for all « > D,. Assume that jim n OS = L. Let (s,) be a sequence of 
distinct points in (a, oo) such that Jim, 8n = 00. Since lim. fe) = jim, Ga) =O eit 


follows that lim TS) eS lim g(Sn) = 0 by Exercise 19 on page 118. We now prove 


that lim “6 = LZ. Let ¢ > 0. Since lim F(z) = [, there is a Dz > D, such that 
noo 9(Sn) xr oo 9'(x) 


— L ie 5 for all x € (a, 00) such that x > Do. (5.18) 


Since lim Sn = 00, there is an N, € N such that s, > Dg, for all n > Ny. Let 
nr CO 


m > N,. For each n > m, there is a point c, between s,, and s,, such that 


f (8m) — f (Sn) =, f'(en) 


5m) — Gn) ¥en)’ (5.19) 
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by Theorem 5.2.9. Since c, > D2 for all n > Nj, (5.18) and (5.19) imply that 


f (8m) — f($n) 
9(Sm) — g(Sn) 


I] < : for alln > Nj. (5.20) 


As Jim (Sq) = Jim g(Sn) = 0, it follows that 


Met G\S: = g(Sn) 9(Sm) 
Therefore, Fou 7 -1 < § by (5.20) and Corollary 3.2.16. Thus, Few - L| < ¢ for 


allm > N,. Hence, im Hoa = [. So jim n A = = L by Exercise 19 on page 118. 


The following version of L’H6pital’s Rule involves a infinite limit which is usually 
identified as having the form 3; however, the proof only requires that the limit of 
the denominator be infinite. 


Theorem 5.2.12 (L’H6pital’s Rule III). Let f and g be differentiable on J \ {c}, 
where J is an interval and cis an interior or end point of J. Suppose that lim 1 g(a ) = 00 


and g'(x) £0 for all x € J \ {c}. If lim Fi} = Land L ER, then lim , He) = 1. 


Proof. Let f, g, c, and I be as stated. Let c be an interior point and assume that 
lim 15 " = L. Let (s,) be a sequence of distinct points in J that converges to c 


qilete Sn > c for all n EN. Since lim g(x) = oo and lim Sn = ¢, it follows that 


lim g(s,) = oo by Exercise 17 on page 118. We now prove that lim ea = L. Let 


noo 


e > 0. Since lim 1 Fe } = L, there is a 6 > 0 such that 


Since lim g(sp,) = oo and lim s, =, there is an N; € N such that g(s,) > 0 and 
n—0o Noo 


< = for all x € I such that 0 < |x —c| < 6. (5.21) 


|S, —c| <0, for all n > Nj. For each n > Nj, there is a point c, between s, and sy, 


such that fGy—iou). 7) 
Sn) — SN, _ Cr 
esa Gay (5.22) 


by Theorem 5.2.9. Since 0 < |e, — c| < 6 for all n > My, (5.21) and (5.22) imply that 


OiOn 
[citea) 


< 5 for all n > Nj. (5.23) 
As lim g(Sn) = 00, let No € N be such that Np > N; and 


9(sn) — 9(sm,) _ , _ 9(sm.) 


weer co a(n) 


<1 foralln > No. 
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Thus, from (5.23), we have 


g(sn) — g(smi) pee — f (sn) 
G(Sn) — 


—g € € 
Il) < < f lln > No. 
G(8n) GSN; ) | QQ Are ae 


So, by algebra, 


= for alln > No. 


Hence, by more algebra, 


(Le) 1) - (Eee g09 <5 eat a 


Theorem 2.2.7(iii), the backward triangle inequality, thus implies that 


Fey <a ta 


for alln > No. (5.24) 


Since lim g(Sn) = 00, there is an N3 € N such that N3 > No and 


—L 

ee gsm) <= for alln> N3. (5.25) 

9(Sn) 2 
Thus, (5.25) and (5.24) imply that Een} - L| < €, for all n > N3. We conclude that 
lim £S) — [. Therefore, lim [% = (see Remark 4.3.11). A similar argument 

n300 9(Sn) act g(x) 

shows that lim {2 = L. Therefore, lim ff) _ fp, by Theorem 4.3.12. The above 

rt c7 g(x) roe 9() 


argument shows that the theorem also holds when c is an endpoint of J. 


Remark 5.2.13. We note that if lim g(x) = —oo, then lim —g(x) = oo. This fact 
can be used to show that Theorem 5.2.12 implies that the theorem also holds if 
condition lim g(x) = 00 is replaced with lim g(x) = —oo. 


Theorem 5.2.12 and it’s proof do not require that lim f(z) = co, that is, the 


numerator is not required to have an infinite limit. We take advantage of this fact in 
our next example. 


Example 5.2.14. Let e” be the natural exponential function. Let h: R > R be 
defined by 
en l/x? : ; 
h(x) = a Nhe Us 
0, ifr =0. 
We shall show that lim, h(x) = 0. Note that 
2 
—1/x? 1 
ge ae (5.26) 


x el/x? b) 
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whenever x # 0. Letting f(z) = +, g(x) = el/ T = R, and c = 0, we see that the 


conditions of Theorem 5.2.12 are satisfied. Since 
eC a eee 7 
io Oe) — _ el/ax? — — in Jel/e? — ~? 


Theorem 5.2.12 implies that lim 48 = 44% = 0. Hence, by (5.26), lim h(x) = 0. 
2-0 g(x) el/e x20 


The proof of our final version of L’Hépital’s Rule is similar to the above proof of 
Theorem 5.2.12 (see Exercise 18). 
Theorem 5.2.15 (L’Hopital’s Rule IV). Let f and g be differentiable on (a, 00). 
Suppose that lim g(x) = co and g(x) £0 for all x € (a, oo). If lim ne = LI and 
LER, then lim fle) — 


co 9(2) 


5.2.2 The Intermediate Value Theorem for Derivatives 


By Theorem 4.4.6, continuous functions satisfy the intermediate value property. In 
this section we will prove that the derivative of a function also has this property, even 
if the derivative is not continuous. First we show that the derivative of a differentiable 
function, satisfies an “intermediate root property.” 


Lemma 5.2.16 (Intermediate Root Lemma). Let f: J + R be differentiable on J, 
an interval. Let a € I and b € I be such that a < b. Suppose that f’(a) <0 < f’(0). 
Then there is an 7 € (a,b) such that f’(i) = 0. 


Proof. Assume that f’(a) <0 < f’(b). Thus, 


Pee ksi 


“za r—-a x—>b x—b 


By Exercise 11 on page 118, there are c € I and d € I such that (A) a<c<d<b 


and 
HO-F@) 4 FM-F0, 
c—a d—b 
Hence, (5.27) and (A) imply that (v) f(c) < f(a) and f(d) < f(b). Since f is 
continuous, Theorem 5.2.1 implies that f attains a minimum value at a point 7 € [a,b], 
and (¥), (A) imply that i € (a,b). Theorem 5.2.1 now implies that f’(i) = 0. 


(5.27) 


The next result is due to Jean Gaston Darboux. It shows that any function that is 
the derivative of a function has the intermediate value property (see Definition 4.4.5). 


Theorem 5.2.17 (Intermediate Value Theorem). If f: J > R is differentiable on 
an interval J, then f’: J —+ R has the intermediate value property on I. 


Proof. Assume that f’(a) < k < f’(b) where a < b and a,b € I. Let g: I > R be 
defined by g(x) = f(x) —k. Clearly, g differentiable on J. Note that g(a) < 0 < g/(b). 
Lemma 5.2.16 implies that there is an 7 € (a,b) such that g/(i) = f’(«) — k = 0. So 
f'(i) =k. If f(a) > k > f(/b), then a similar argument applies using the function 
g(a) =k f(a). 
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5.2.3 Inverse Function Theorems 


There are two results about inverse functions that we will address in this section. We 
first prove that the inverse of a continuous function is continuous, and then prove that 
the inverse of a differentiable function is differentiable. The first result will be used 
to prove the latter. We begin by identifying a shorter way of saying that a function 
is “either increasing or decreasing.” 


Definition 5.2.18. Let f: J > R bea function where J is an interval. We say that f 
is monotone on J if f is either increasing or decreasing on I. Moreover, if f is either 
strictly increasing or strictly decreasing, then we say that f is strictly monotone. 


The Inverse of a Continuous Function 


Let f be a continuous injection defined on an interval. So the inverse function f~! 
exists; but, is it continuous? Since f is a continuous injection, f must be strictly 
monotone (see Theorem 4.4.6 and Exercise 23). The next lemma will be used to show 
that, in fact, f~! is continuous. 


Lemma 5.2.19. Let g: J — R be strictly monotone on the interval J. If g[/] is an 
interval, then g is continuous. 


Proof. Let g: I > R be strictly increasing on the interval J and assume that g[J] is 
an interval. [The proof is similar when g is strictly decreasing.] To prove that g is 
continuous, let c € J and ¢ > 0. There are three cases to consider. 
CASE 1: g(c) is an interior point of the interval g[]. Let «* > 0 be such that e* < 
and 

[g(c) — €, g(c) + €*] C gl]. 
Let a,b € J such that (a) g(a) = g(c) — &* and g(b) = g(c) + &*. Since g is strictly 
increasing and g(a) < g(c) < g(b), it follows that a < c < 6 (see Exercise 6). Let 
6 = min{|a —c|,|b—c|}. Let  € I be such that |x — c| < 6. Hence, a < x < b. Thus, 
as g is strictly increasing, g(a) < g(x) < g(b). By (a), we have that 


g(c) —&* < g(x) < g(c) +. 


Therefore, |g(x) — g(c)| < e* <e. 
CASE 2: g(c) is the left endpoint of g[J]. Since g is strictly increasing, c must be the 
left endpoint of [. The proof of Case 1 adapts to prove that g is continuous at c. 

CASE 3: g(c) is the right endpoint of g[J]. As g is strictly increasing, c must be the 
right endpoint of J. The proof of Case 1 adapts to prove that g is continuous at c. 


Theorem 5.2.20. Let f: / — R be strictly monotone and continuous on J, an 
interval. Then J = f[J] is an interval, f is one-to-one, and f~!: J — I is strictly 
monotone and continuous on J. 


Proof. Let f: I > R be strictly monotone and continuous on an interval J, and let 
J = f[J]. Theorem 5.2.7 implies that f is one-to-one, and Theorem 4.4.7 implies that 
J is an interval. Thus, f~': J > IJ is strictly monotone, by Exercise 9. As f~+[J] = I 
where J and J are intervals, Lemma 5.2.19 implies that f~! is continuous. 
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For an interval J, let J* denote the set of points in J that are not endpoints of I. 


Corollary 5.2.21. Let f: J — R be continuous on the interval J. Suppose that 
f'(x) 4 0 for all x € I*. Then J = f[J] is an interval, f is one-to-one, and f~!: J > I 
is strictly monotone and continuous on J. 


Proof. Let f: I — R be continuous on the interval J where f’(x) 4 0 for all x € I*. 
Theorem 5.2.17 implies that either f’(a) > 0 for all x € I*, or f’(x) < 0 for all 
x € I*. Hence, by Theorem 5.2.8, f is strictly monotone on J*. The Mean Value 
Theorem 5.2.3 now implies that f is strictly monotone on J. Theorem 5.2.20 thus 
implies that f is one-to-one, f[J] is an interval, and f~1 is continuous on f{[J]. 


The Inverse of a Differentiable Function 


We can now establish a relationship between the derivative of a strictly monotone 
function and the derivative of its inverse. 


Theorem 5.2.22. Let f: J — R be differentiable on the interval J. Suppose that 
f'(x) £0 for all x € I. Then f is one-to-one, f [J] is an interval, and f~!: f[I] — I 
is differentiable on f[J]. Moreover, (f~')'(y) = FO for ally € f[J]. 
Proof. Let f: I + R be such that that f’(2) 4 0 for all x € J. By Corollary 5.2.21, 
f is one-to-one, f[J] is an interval, and f~! is continuous on f [J]. 
To show that f~! is differentiable on f[J], let y € f[J]. Now let c € I be such 
that (1) f(c) = y. Hence, (2) c= f~1(y). We will show that 
-1 _ f-l 1 
a ee ee 
zy ay fF) 
by applying Theorem 4.3.4. Let (yn) be a sequence of points in f[J] \ {y} such that 
lim Yn = y. Thus, there is a sequence of points (c,) in I \ {c} such that f(cn) = Yn 


for alln € N. So (3) Cn = f~'(yn) for all n € N. Since f~! is continuous, we see that 
(A) im, f-'(un) = f-+(y). So (a), (2), and (3) imply that jim cn = ¢. Because f 


is differentiable, Theorem 4.3.4 implies that (¥) lim Len)“ F0) = f'(c). Therefore, 


_ fF 'oyn)-f tw _, Cn — C 
lim = lim ————__ by (1), (2), and (3) 
n—-y00 Yn —Y noo f(Cn) — fle) 
1 
= Lh FeO, Pysalgchre 
1 1 


= Fo = FU) by Theorem 3.2.6, (¥), and (2). 


Thus, f—' is differentiable on f[J], and (f~1)'(y) = FO) for all y € f{J]. 


The next theorem, which follows from the proof of Theorem 5.2.22, focuses on a 
single point in the domain of an inverse function. 
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Theorem 5.2.23. Let f: J — f{J] be strictly monotone and continuous on the 
interval J. Then f[J] is an interval and f~': f[{J] + I is continuous. Let d € f[J] and 
c € I be such that f(c) = d. If f’(c) £0, then f~': f[I] > I is differentiable at d 
and (f~*)'(d) = 77q=7—y 


LFA (@))* 


Exercises 5.2 


1. 
2. 


*6. 


*9. 


10. 


11. 


*12. 
13. 


14. 


15. 


Prove Theorem 5.2.7. 

Let f: [0,2] — R be continuous on [0,2] and differentiable on (0,2). Suppose 
that f(0) =0 and f(1) = f(2) =1. 

(a) Show that there is a c € (0,1) such that f’(c) = 1. 


(b) Show that there is a c € (1,2) such that f’(c) = 0. 
i 


. Let f(x) = = and let |a, b] be an interval where a > 0. Find the point c guaranteed 


x 
by the Mean Value Theorem. Since a < c < b, derive an interesting inequality. 


. Prove that the equation x7 + 2° + 23 +2 +1=0 has exactly one real solution. 


Hint: For uniqueness, use Rolle’s Theorem. 


. Let f: (0,1] > R be differentiable on (0, 1] such that | f’(2)| <1 for all x € (0, 1]. 


i 
n 


Define a sequence (t,) by t, = f(=) for all n > 1. Prove that (t,) converges. 


Let f: J — R be a function where J is an interval. Let f be strictly increasing 
on I and let y,z € I. Suppose that f(y) < f(z). Prove that y < z. 


. Let f: [0,0co) + R be continuous on [0, 00) and differentiable on (0,00). Suppose 


f (0) = Oand that f’ is increasing on (0, 00). Define g: (0,00) > R by g(x) = fe) 
Prove that g is increasing on (0, 00). 


. Let f: [> R be differentiable on the interval J. Prove the following: 


(a) If f is increasing on J, then f’(x) > 0 for all x € I. 
(b) If f is decreasing on J, then f’(x) < 0 for all x € I. 


Let f: [ — R be strictly monotone on the interval J. Theorem 5.2.7 implies 
that f is one-to-one. Now suppose that f[/] is an interval. Prove that the inverse 
function f~!: f[I] > I is strictly monotone. 


Complete the proofs of Case 2 and Case 3 in the proof of Lemma 5.2.19. 


Let f: I + R be differentiable on an interval J. Suppose that f’ is not constant. 
Show that f’ takes on some irrational values. 


Prove Theorem 5.2.8(b). 


Provide an example of a function that satisfies the conditions of Theorem 5.2.8 
and shows that the converse of Theorem 5.2.8(a) does not hold. 


Suppose that f: R — R and g: R — R are both differentiable and strictly 
increasing on R. Prove that (f og): R > R is also strictly increasing on R. 


Let i € I, where J is an interval. Suppose that G: J > R and f: I > R are 
continuous. Prove that if G’(x) = f(x) for all x in I \ {7}, then G’(i) = f(i). 
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1 Csi 
ne) lim @ and lim < an 
# xz—0+ - rot “Me 


*17. For every integer n > 0, define h,: R > R by 


ee, ; 
meal ie > Hatt 
0, ifr2=0. 


16. Evaluate lim 
x1 


It follows that lim, ho(a) = 0, and Example 5.2.14 shows that lim, hile) =, 
=> = 
Prove, by an induction, that lim, hole) =0 forall n> 1. 
ara 


*18. Prove Theorem 5.2.15. (See Exercise 19 on page 118.) 


19. Let f: R— R is differentiable on R. Let M > 0 be such that | f’(a)| < M for all 
x ER. Prove that f is uniformly continuous. 


20. Let f and g be differentiable on (—oco, b). Suppose that 
(1) lim g() = 00, 
(2) g(x) £0 for all x € (—co, d)). 
Show that Theorem 5.2.15 implies that if jim f ey = Land L € R, then 
lim oe = 15, 


t—oo Je 

21. Let f: J > R be continuous on the interval J. Let 7 € I. Suppose that f’(x) > 0 
for alla € I\{j} and f’(j) = 0. Prove that f is strictly increasing on J. Conclude 
that f~!: f[Z] > I exists and is continuous. 


22. Let f: J — R be one-to-one function with the intermediate value property on 
an interval J. Let x,y,z € I be such that « < y < z. Suppose that either 


f(x) < fy), FY) < F(z), or f(x) < f(z). Prove that f(x) < fly) < f(z). 


*23. Suppose that f: J — R is one-to-one and has the intermediate value property 
on an interval J. Using Exercise 22, prove that f is strictly monotone. 


Exercise Notes: For Exercise 5, use Theorem 5.2.3 and Exercise 4 on page 90 to 
show the sequence is a Cauchy sequence. For Exercise 7, prove that g/(x) > 0 for all 
x € (0,00). First evaluate g/(x) by using the quotient rule and then show that there 
is ac such that 0<c< wand f(x) = f’(c)z. For Exercise 8, use Exercise 10 on page 
117. For Exercise 15, use Theorem 4.3.4 and Theorem 5.2.3. If 7 is an endpoint of J, 
then the derivative G’(7) is a one-sided limit of the difference quotient. For Exercise 17, 
use Example 5.2.14 as a model. For Exercise 18, modify the proof of Theorem 5.2.12. 
For Exercise 20, see Exercise 24 on page 118. For Exercise 23, assume that f is not 
strictly decreasing, and then prove that f is strictly increasing. 


5.3. TAYLOR’S THEOREM 


We next discuss another generalization of the Mean Value Theorem which involves 
higher order derivatives. This generalization can allow one to approximate certain 
functions by polynomial functions. 
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Higher Order Derivatives 


Let f: I > R be differentiable on an interval J. Then, for all c € I, we have that 


Hj OO. 


rc L—C 


Thus, the derivative of f is itself a function from J to R, that is, f’: J > R. Ifce I 
and f’ is differentiable at c, then the derivative of f’ at cis denoted by f”(c); that is, 


$0) = tim POLLO. 


wc Ge =E 


The value f”(c) is called the second derivative of f at c. In this case, f is said to 
be twice differentiable at c. The value f’(c) is also denoted by f®)(c). In many 
cases, this process can be repeated. We will say that a function f: J > R is n-times 
differentiable on J, if f can be differentiated n-times at each point c € J, where 
n > 1 is a natural number. In this case, we denote the nth derivative of f at c by 
f™(c). Moreover, we let f = f. 


A Mean Value Theorem for Higher Derivatives 


Let I be an open interval and let f: J > R be differentiable on J. The Mean Value 


Theorem 5.2.3 implies that for any two distinct points a € I and b € J, there exists 


a point c between a and b such that f’(c) = FO) F(a) | that is, 


f(b) = fla) + f'((— a). 
We now present a Mean Value Theorem for the second derivative of a function. 


Theorem 5.3.1. Let f: J — R be twice differentiable on an open interval J. Then 
for any two points a € I and b € J, there is a point c between a and 6 such that 


(0) = F(a) + F(a)(0— a) + F(a. (5.28) 


Proof. If a = 6, then (5.28) holds for c = a. So, assume that a 4 b. Let M be the 
unique real number that satisfies the equation 


f(b) = f(a) + f'(a)(b — a) + M(b— a)’. (5.29) 
Define F': I > R by 
Fi) = fo+f/@OO-)+Mo-#". (5.30) 


It follows that F’ is differentiable on J. So, by Corollary 5.1.3, F is continuous on J. 
In particular, F is differentiable on the open interval with endpoints a and b, and F 
is continuous on the closed interval with endpoints a and b. Note that 


F(a) = f(a) + f’(a)(b— a) + M(b—a)?. 
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Thus, by (5.29), F(a) = f(b). Moreover, by equation (5.30), we see that F'(b) = f (0). 
Hence, F(a) = F(b). Theorem 5.2.2 implies that there is a c between a and b such 
that (a) F’(c) = 0. Equation (5.30) implies that 


F'(o) = file) + f"()(6— 0) — file) —2M(b— c) = f"(c)(b— c) — 2M(b — c). 
Hence, from (4), we conclude that F’(c) = f”(c)(b—c) —2M(b—c) = 0. Solving the 
latter equation for WM, we obtain M = a Therefore, equation (5.29) implies that 


f"(o) 
2 


f(b) = fa + f'(@b—a) + (b— a)”. 


The proof of Theorem 5.3.1 can be extended to prove the following theorem (see 
Exercise 4). 


Theorem 5.3.2 (Taylor’s Theorem). Let f: J + R be (n + 1)-times differentiable 
on an open interval J. For any two points a € J and b € J, there is a point c between 
a and b such that 


f"(@) 


f(b) = fa) + F(ab - a) +--+ 
f(a) 1 £6) nae (5.31) 


Equation (5.31), in Theorem 5.3.2, can be written as f(b) = >> LPO (p —a)*, 
k=O 


Taylor’s Theorem can be used to approximate f near a point a by a polynomial 
function. Let f: J + R be (n+1)-times differentiable on an open interval J containing 
a, and for all x € TJ let 


mn FR (q 
Pie S- pit My —a)* and Ry(x) = f(x) — Pa(z). 


fy. oh! 


P,, is called the nth Taylor polynomial of f centered at a, and R, is said to be the 
remainder term, which measures the difference between f and P,. For any b € J, 
Theorem 5.3.2 states that there is a point c between a and b such that 


foro 


Ra(b) = “Gay 


(b = ayer. 
This result offers another way to measure R,,(b) = f(b)—P,,(b), the difference between 
f(b) and P,,(0). 


Example. To illustrate the usefulness of Taylor’s theorem in approximations, let 
f(x) = e® for all x € R. Recall from calculus, that f(a) = e® for all n € N. We 
are interested in approximating e” near a = 0. Since f(0) = 1 and f(0) = 1 for all 
n EN, we see that nth Taylor polynomial of f centered at 0 is 

Lae (ee gg? ae” 


e=lt+r+—+4+24---4 


P(x) = kl a 3 


k=0 
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Let n = 5 and let b € [—1, 1]. Since R5(b) = £46 = <b° for some c between 0 and 
b, we conclude that 


R;(b) = 18 = — < .0038. 


6! ~ 6! 6! 
a + a + v e differs from e? by less than .0038. 


ia 
nN 


So, for b € [—1, 1], P5(b) =1+b 


Exercises 5.3 


1. Let f: R — R be twice differentiable on R and let c € R. Show that 
2c—x)—2 
te) = tim FO)+ fe— 2) - 24(0) 


rc (x _— ce)? 


2. Prove Theorem 5.3.2 for n = 2. 


3. Prove the following by mathematical induction on n > 1: Let f: J > R be 
(n + 1)-times differentiable on an open interval J. Define F,,: I > R by 


Fo) =f + P@e—)4 Me—ore..4 LOG or (5.32) 


2! n! 
Then Fi(t) = Oo — 2)”. [Hint: Fasi(t) = a(t) + FOO - ory 


*4. Using the result of Exercise 3, prove Theorem 5.3.2. 


5. Let f: I > R be (n+1)-times differentiable on an open interval J. Suppose that 
f@*)(z) = 0 for every z € I. Let a € J and b € I. Show that 


” (n) 
£0) = F(a) + FO a) + Fa)? +--+ FO ayr 
6. Let f: J > R be (n + 1)-times differentiable on an open interval J. Let M > 0 


and d > 0. Suppose that Fical) < M for every cE I. Leta € I andb€ I be 
such that |b — a| < d. Show that 


nek) (g 
hw Oo —ayh 


Mad} 


— j 
~ (n+1)! 


7. Let f: [k,@] — R be twice differentiable on the open interval (k, 2). Suppose that 
f and f’ are continuous on [k, 4]. Let a € [k, 4] and b € [k, 4]. Show that the proof 
of Theorem 5.3.1 can be slightly modified to demonstrate that there is a point c 
between a and 6 such that 
7 / ma) 2 
f(b) = fla) + fi(a)(— a) + ——(b— a)". 
8. Let f(x) = sin(x). Find Py(x), the 4th Taylor polynomial of f centered at 0. For 
b € [1,1], estimate how close is P,(b) to sin(b)? 
Exercise Notes: For Exercise 1, see Theorem 5.2.10. For Exercise 4, note that 
F\(t) = f(t) + f’(£)(b — t). Modify the proof of Theorem 5.3.1 (see equations (5.32) 
and (5.30)). 


CHAPTER 6 


Riemann Integration 


One of the oldest problems in mathematics is called the area problem: 


How to find the area of a plane figure whose boundary has one or more 
curved edges? 


Archimedes (287-212 BC) developed an ingenious method for finding the area of a 
circle and a few other figures with curved edges. Unfortunately, Archimedes method 
could not be widely applied. The general problem of evaluating the area of a region 
with a curved boundary plagued mathematicians for nearly two thousand years. Then, 
in the late 17th century, Isaac Newton and Gottfried Leibniz independently developed 
a method that can be used to solve the area problem for a wide variety of regions 
bounded by curves. 

Let us consider one version of the area problem: Let f: [a,b] > R be continuous 
and positive. Let R be the region under the curved graph of f and above the interval 
[a, 6] on the z-axis. Newton and Leibniz showed that one can evaluate the area of 
R, by finding an antiderivative of f. This discovery is generally identified as the be- 
ginning of calculus and allowed mathematicians to solve a score of applied problems. 
Such success may have caused mathematicians to overlook an important foundational 
question: What does the “area of R” really mean? Eventually, in the middle of the 
19th century, Bernhard Riemann gave a mathematical precise definition of the area 
under a function, now called the Riemann integral. 

In this chapter we present a rigorous development of the theory of Riemann inte- 
gration. The reader should be familiar with the concept of integration from calculus. 
However, in this chapter, we focus on the proofs of many of the theorems that are 
applied in a calculus course. We follow a development due to Jean Gaston Darboux, 
which, when compared to Riemann’s original idea, is easier to understand. 


6.1. THE RIEMANN INTEGRAL 


The definition of the Riemann integral is technically more involved than many of 
the other concepts discussed in this book. Before presenting this definition, we first 
develop the necessary terminology, notation, and preliminary lemmas. This is done 
in the next two sections. 
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6.1.1. Partitions and Darboux Sums 


The notion of a partition of a closed interval is needed in order to define the Riemann 
integral. A partition allows one to split an interval into a finite number of subintervals. 


Definition 6.1.1 (Partition of an Interval). Let [a,b] be an interval. 


e A partition P of [a, }] is a finite set P = {%9,71,%2,...,%,} of points such that 
A=% <4 <%2<+++ <a =b. 


e If P and P* are partitions of [a,b] and P C P*, then P* is a refinement of P. 


The points in a partition P are always listed in strictly increasing order where the 
first point is the left endpoint of the interval and the last point is the right endpoint 
of the interval. A refinement of a partition P is just a partition that is obtained by 
adding more points, from the interval, to P. 


Remark. Given two partitions P and Q of [a, }]. By listing the points of PUQ in 
increasing order, we see that PU Q is also a partition of [a,b]. Thus, PU Q is a 
refinement of both P and Q. 


Example. Let a = 0 and 6 = 2. Then P = {0 78 5, 2} and Q = {0,3 a $,1, 2} are 
two partitions of [0,2]. So, PUQ = {0, 4 = $1, 2} is also a partition of [0,2] and is 
a refinement of both P and Q. 

Definition 6.1.2 (Summation Notation). Let (a1, a2,...@n) be a finite sequence of 


real numbers. Then s a; = a, +a2+---+a,. Whenever 1<k <n, S° a; represents 
itk 
the sum of that a;’s alee i # k. Moreover, if A C {1,2,...,n} is nonempty, then 


>> a; denotes the sum of the a;’s where i € A. If A= @, then YS a; = 0. 
icA icA 


Let f: [a,b] — R be a positive bounded function. The next definition identifies 
Darboux idea on how to approximate the area between the graph of f and the z-axis. 
This is done by summing the area of rectangles that are circumscribed above that 
graph of f (see Figure 6.1) and by summing the area of rectangles that are inscribed 
below that graph of f (see Figure 6.2). 


Definition 6.1.3 (Darboux Sums). Let f: [a,b] > R be bounded. Given a partition 
P49, 21a say ty} OF log), for each 4 — 1262. let 


Mi(f) = sup{f(z) : 2 € [x:-1, xi] }, 
im fF) = inl fe) eS | as I}, 
Aa; = Li G-1- 
When there is no ambiguity, we shall write M@; = M;(f) and m; = m;(f). In addition, 
we define the upper sum of f with respect to P by U(f, P) = - M;Ax; and define 
i=l 


the lower sum of f with respect to P by L(f, P) = 3 mAx;. 
i=l 
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Let f: [a,b] + R be a positive bounded function and let P be a partition as 
in Definition 6.1.3. To obtain the upper sum U(f, P), for each i = 1,2,...,n one 
obtains a rectangle with base Az;, height M;, and area M;Az;. Then U(f, P) is the 
sum of these rectangular areas. Similarly, each lower sum L(f, P) is equal to the sum 
of the areas of rectangles with base Az; and height m;. Since m; < M; for each 
i=1,2,...,n, we see that L(f, P) < U(f, P). 


Remark 6.1.4. In Definition 6.1.3, letting f([zi-1, xi]) = { f(x) : © © [ai-1, xi] }, we 
can write M;(f) = sup[f([xi-1, v])] and mi(f) = inf[f([2s-1, x;])). 


Figure 6.2: Lower sum of f with respect to P equals L( f, P) = 23.9974. 


Example. Define f: [—3,3] > R by f(x) = x? —sin(3z) + 2. Consider the partition 
P = {-3, —2.3, —2, -1.5, 0,1, 1.5, 2, 2.75, 3} of [—3,3]. The upper sum U(f, P) and 
the lower sum L(f,P) are illustrated in Figures 6.1 and 6.2, respectively. Observe 
that Az; = —2.3 — (—3) =.7 and Avg = 1.5-1=.5. 


6.1.2 Basic Results Regarding Darboux Sums 


Lemma 6.1.5. Let f: [a,b] + R be a bounded function. Let P = {%9,21,%2,..., Un} 
be a partition of [a,b]. Then for alli = 1,2,...,n, we have m; < Mj. 


Prooj.. Cleatly,.1my = int| 7 ([aj-4, @2))| < aul fj (lea, ey) )) = 1, fort = Loe, 
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It should be clear that for any particular partition, the lower sum will be less 
than or equal to the upper sum; however, we will now formally verify this. 


Lemma 6.1.6. Let f: [a,b] + R bea bounded function. Let P = {%9,71, %2,..., Un} 
be a partition of [a,b]. Then L(f, P) < U(f, P). 


Proof. Let f: [a,b] + R be a bounded function. Let P = {x0,71,%2,...,%n} bea 
partition of [a,b]. By Lemma 6.1.5, m; < M; for all i =1,2,...,n. Hence, 


i=1 11 


In the following two lemmas, we confirm two algebraic identities that will be 
applied later. 


Lemma 6.1.7. Let f: [a,b] + R bea bounded function. Let P = {%9,71, %2,..., Un} 
be a partition of [a,b]. Then U(f, P) — L(f, P) = > (M; — m)Az;. 


i=] 
Proof. By algebra, 


n 


i=1 i=1 


i=l 


Lemma 6.1.8. >> Ax; = b —a for any partition P = {x9, 21, ¥2,...,%n} of [a, O]. 
i=l 


n n 
Proof. Since x9 = a and x, = b, we have 95 Ax; = D3 (a; — 2j-1) = Ln — Zo =b- 
i=l i=l 


(see Exercise 9 on page 27). 


The Refinement Theorem 


Given a partition, we can evaluate the upper and lower sums of a function. We will 
now show that a refinement of this partition will comparatively decrease the upper 
sum and increase the lower sum; that is, by refining a partition “the upper sum will 
go down and the lower sum will go up.” 


Theorem 6.1.9 (Refinement Theorem). Let f: [a,b] + R be a bounded function. 
If P is a partition of [a,b] and P* is a refinement of P, then 


L(f,P) < Lf, P*) < Uf, P*) < UF, P). (6.1) 


Proof. Let f: [a,b] + R be a bounded function. Let P and P* be partitions of [a, j 
where P* is a refinement of P. Let P = {x0,21,%2,...,%n,}. So, the upper sum of f 
with respect to P is 


Lemma 6.1.6 implies that L(f, P*) < U(f, P*), the “middle” inequality of (6.1). We 
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now prove that U(f, P*) < U(f, P). First, suppose that the partition P* contains 
just one more point than P. So P* = PU {a*} and xp_1 < x* < xy for some natural 
number k where 1 < k < n. In equation (6.2), let us “isolate the interval |a,~1, 2%]” 


in the sum (6.2) for U(f, P) by separating the term M;,Az, from the sum >> M;Az,; 
i=l 
and rewriting this sum as 


U(f,P) = | 5° MjAz; | + MpAze. (6.3) 
28 
Recall that M, Ax, = My(x~ — 2p-1) where M; = sup|f([rp_1, 2¢])]. By “isolating 
the two intervals [7,_1,2*] and [x*, x,]” in the sum for U(f, P*), we see that 


U(f, P*) = (= as, +81” — eA) + salte = 2"), (6.4) 

itk 
where s; = sup[f({#,—-1,2*])] and so = sup[f([2*, xx])]. Since s; < My and so < Mg, 
we have that 


81(a* — p_1) + 8o(ap — 2*) < My (a* — wp_1) + Mp (ape — 2*) 


6.5 
= My (xr = Lk-1) = M,Ax,. ( ) 


Hence, (6.3), (6.4), and (6.5) imply that U(f, P*) < U(f,P) when P* contains one 
additional point. Now suppose that P* contains 7 > 1 many more points than P. So 
P* = PU{z},%3,..., aj}. Let Pf = PU {xj,x5,..., 27} whenever 1 <i < j. Thus, 
Pi = P* U{ai}, Px = P*U {27,5}, ..., and P? = P*. For each i < j, the partition 
P;., contains just one more point than P* and thus, the above argument shows that 


v 


U(f, P§.1) < U(f, P*). We conclude that 


Therefore, U(f, P*) < U(f, P). The proof that L(f, P) < L(f, P*) is similar. 


We now show that every lower sum is less than or equal to every upper sum, of 
the function. 


Lemma 6.1.10. Let f: [a,b] > R be a bounded function. If P and Q are partitions 
of [a,b], then L(f,P) < U(f,Q). 


Proof. Let f: [a,b] — R be a bounded function and let P and Q be partitions of 
[a, 6]. Since P* = PUQ is a refinement of P, Theorem 6.1.9 implies that 


LAG FP) SUC a) (6.6) 
Since P* = PUQ is a refinement of Q, Theorem 6.1.9 implies that 

U(f,P*) < U(f,Q). (6.7) 
Hence, (6.6) and (6.7) imply that L(f,P) < U(f,Q). 
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The Boundedness Lemma 


Let f: [a,b] — R be bounded. Is the set of all Darboux sums of f bounded? That 
is, are there real numbers @ and 6 such that a < L(f,P) < U(f,P) < @ for all 
partitions P of [a,b]? The following lemmas answers this question in the affirmative. 


Lemma 6.1.11. Let f: [a,b] + R be a bounded function and let m and M be such 
that m < f(x) < M for all x € [a,b]. Let P be a partition of [a,b]. Then 


m(b— a) < L(f, P) < U(f, P) < M(b—a). 


Proof. Suppose that m < f(x) < M for all x € [a,b], and let P = {29, 71, 22,...,2n} 
be a partition of [a,b]. So, by Lemma 6.1.6, L(f, P) < U(f, P). We now prove that 
U(f, P) < M(b—a). Note that M; < M for all i =1,2,...,n. Hence, 


i=1 


i=l = 


The proof that m(b— a) < L(f, P) is similar. 


So if f: [a,b] + R is bounded, then the sets of real numbers 
{U(f, P): P is a partition of [a,b]} and {L(f, P) : P is a partition of [a,b]} (6.8) 
are also bounded. 


6.1.3 The Definition of the Riemann Integral 


Since Lemma 6.1.11 implies that the sets in (6.8) are bounded, the completeness 
axiom ensures that the following upper and lower integrals of f exist. 


Definition 6.1.12. Let f: [a,b] + R be a bounded function. 


e The upper integral of f on [a,b], denoted by U(f), is defined by 


U(f) =inf{U(f, P) : P is a partition of [a, 5) }. 


e The lower integral of f on [a,b], denoted by L(f), is defined by 
L(f) =sup{L(f, P) : P is a partition of [a, b]}. 
Thus, the upper integral is equal to the infimum of the upper sums and the lower 
integral is equal to the supremum of the lower sums. 


Comment. Let f: [a,b] > R be a bounded function. Given any partition P of [a, 6], 
we always have that L(f,P) < L(f) and U(f) < U(f, P). 
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Let f: [a,b] — R be positive and bounded. Suppose we know that A is the area 
of the region under the graph of f and above the interval [a,b]. As in Figure 6.1, 
one can view each upper sum U(f, P) as an overestimate of A, that is, A < U(f, P). 
Thus, A is a lower bound for the set {U(f,P) : P is a partition of [a,b]}. Hence, 
A <U(f). Similarly, L(f) < A. Thus, L(f) < A < U(f). Therefore, if L(f) = U(f), 
then L(f) = A= U(f). So if L(f) = U(f), then the area under f can be defined to 


be this common value. 


Definition 6.1.13. Let f: [a,b] — R be a bounded function. If L(f) = U(f), we 
say that f is Riemann integrable on [a,b] and we shall denote this common value 


by ry or by f? f(x) de 


We often say that a function f is integrable on [a,b], rather than stating that f 
is Riemann integrable. The term [ ie f is called the integral of f over [a,b]. We now 
present three basic lemmas that establish three useful inequalities. 


Lemma 6.1.14. Let f: [a,b] ~ R be a Riemann integrable function. If P is any 
partition of [a,b], then L(f, P) < Lie i SUF Fh 


) = L(f) = U(f). Let 
P be a partition of [a,b]. As L(f,P) < L(f) and U(f) < U(f,P), (a) implies that 
Lf,P) S Sof <U(hP). 


Lemma 6.1.15. Let f: [a,b] > R be Riemann integrable. Suppose m, M € R are 
such that m < f(a) < M for all « € [a,b]. Then m(b—a) < f? f < M(b—a). 


Proof. Let f: [a,b] + R be Riemann integrable. Let m € R and M € R be such that 
m < f(x) < M for all x € [a, 6]. Lemma 6.1.11 implies that 


m(b—a) < L(f,P) < U(f, P) < M(b—a) 


for any partition P of [a,b]. Therefore, Lemma 6.1.14 now implies that 


we~ayS [i < M(b—a). 


Lemma 6.1.16. Let f: [a,b] > R be a Riemann integrable function. Suppose that 
M > 0is such that |f(x)| < M for all x € [a,b]. Then < M(b- a). 


Proof. Let f: [a,b] + R be Riemann integrable such that | f(x)| < M for all x € [a, b], 
where M > 0. Thus, —M < f(x) < M for all x € [a,b]. Lemma 6.1.15 implies that 


M(b-—a) < [F< Mb-a) 


Consequently, 


fe f| < M(b—a) by Theorem 2.2.6(c). LU 
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6.1.4 A Necessary and Sufficient Condition 


To formally prove that a function is Riemann integrable, it can be difficult to apply 
Definition 6.1.13. Fortunately, there is a more accessible method for showing that a 
function is Riemann integral. In this section we shall establish an important necessary 
and sufficient condition (see Theorem 6.1.18) which we will use to show that many 
functions are Riemann integrable. 


Theorem 6.1.17. Let f: [a,b] —~ R be a bounded function. Then L(f) < U(f). 
Proof. Let f: [a,b] > R be bounded. Recall that 

L(f) =sup{L(f, P) : P is a partition of [a, bj} 

U(f) = inf {U(f, Q) : Q is a partition of [a, bl}. 


Lemma 6.1.10 implies that L(f,P) < U(f,Q) for all partitions P and Q of [a, }]. 
Theorem 2.3.17 implies that L(f) < U(f). 


From the above theorem, we see that L(f,P) < L(f) < U(f) < U(f,P) for all 
partitions P. This indicates that a function is Riemann integrable if and only if one 
can find partitions whose upper and lower sums are arbitrarily close together. 


Theorem 6.1.18 (Necessary and Sufficient Condition). Let f: [a,b] ~ R be a 
bounded function. Then f is Riemann integrable if and only if for every « > 0 
there exists a partition P of [a,b] such that U(f, P) — L(f, P) <e. 


Proof. Let f: [a,b] + R be a bounded function. 
(=). Assume that f is Riemann integrable. So, L(f) = U(f). Let ¢ > 0. Recall that 


L(f) =sup{L(f, P): P is a partition of [a, bj} (6.9) 

U(f) = inf {U(f, Q) : Q is a partition of [a, bl}. (6.10) 
By (6.9) there is a partition P’ such that L(f) — L(f,P’) < §, and by (6.10) there 
is a partition Q such that U(f,Q) — U(f) < §.1 Let P = P’UQ. Because P is a 
refinement of P’, Theorem 6.1.9 implies that L(f, P’) < L(f,P) < L(f) and since 
L(f) — L(f, P’) < 5, we conclude that 


€ 
L(f) — L(f,P) <5: (6.11) 
Similarly, Theorem 6.1.9 implies that 
€ 
UUf,P)-U(f) <5. (6.12) 


Since L(f) = U(f), by adding (6.11) and (6.12), we have that U(f, P) -—L(f, P) <. 


'See Exercises 7 and 8 on page 46. 
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(<=). Suppose that for every « > 0 we have U(f, P) — L(f, P) < € for some partition 
P of [a, 6]. Let ¢ > 0. Thus, (a) U(f, P) — L(f, P) < « for some partition P. Since 


LU(f,P) < Lf) < Uf) < UF, P), 


we conclude? that U(f)—L(f) < U(f, P)—L(f, P). So, by (a), 0 < U(f)-L(f) <«. 
By Corollary 2.2.4, we infer that U(f) = L(f). Thus, f Riemann integrable. 


Definition 6.1.18 and Theorem 6.1.18 present us with two methods for proving 
that a function is Riemann integrable on a given interval. 


Proof Strategy 6.1.19. Let f be a bounded function. To prove that f is Riemann 
integrable on the interval [a,b], use one of the following two proof strategies: 


(a) Apply Definition 6.1.13 directly and prove that L(f) = U(f). 
(b) Apply Theorem 6.1.18 by using the proof diagram: 
Let ¢ > 0 be arbitrary. 


Let P be the partition of [a,b] that you found. 
Prove U(f, P) — L(f, P) <e. 


Strategy 6.1.19(a) requires one to prove something about all partitions using 
Definitions 6.1.13 and 6.1.12, whereas Strategy 6.1.19(b) just involves finding one 
relevant partition. For this reason, Strategy 6.1.19(b) is typically easier to apply. 


Example 6.1.20. Let g: [0,1] > R be defined by 
1, ifa<1; 
i) — 
(2) i ife=1. 


We will use Proof Strategy 6.1.19(b) to show that g is Riemann integrable. Let ¢ > 0. 
Let n € N be such that + < min{e,1} and let P = {0, +, 1}. We will show that 
U(g, P) — L(g, P) < ¢. Since 


we see that U(g, P) — L(g, P) = 4 <«. Thus, g is Riemann integrable. 


Example 6.1.20 presents a function that is continuous everywhere except at an 
endpoint. It is then shown that this function is Riemann integrable. The argument 
used in this example, inspires the proof of our next theorem; but, before proving this 
theorem, we identify a relevant assumption strategy. Whenever we have a function 
that is Riemann integrable, Definition 6.1.13 and Theorem 6.1.18 allow us to reach 
two conclusions that can be applied in a proof. 


"Ife <a<y<d, then y—x<d-c. 
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Assumption Strategy 6.1.21. Let f: [a,b] — R be bounded. If f is Riemann 
integrable on [a,b], then both of the following assertions hold: 


(a) L(f) =U(f). 
(b) For any ¢ > 0, there exists a partition P of [a,b] such that U(f, P)-—L(f, P) <e. 


Theorem 6.1.22. Let f: [a,b] + R be bounded. If f is Riemann integrable on [c, }] 
for every c € (a,b), then f is Riemann integrable on [a,b]. A similar result holds for 
the endpoint. b. 


Proof. Let f: [a,b] + R be bounded and assume that (A) f is Riemann integrable on 
[c, 6] for all c € (a,b). Let ¢ > 0. Let S = sup|f({a, b])] and let T = inf[f({a, b])]. Let 
n € N be such that S-T < § anda++4 <b. By (a) and Theorem 6.1.18, there exists 
a partition P, of [a+ +,b] such that U(f, Pi) — L(f, Pi) < §. Now let P= {a} UP. 
Clearly, P is a partition of [a,b]. One can now show that U(f, P)—L(f, P) < e. Thus, 
by Theorem 6.1.18, f is Riemann integrable on |a, }J. 


In Section 6.2.3, we will show that a function is integrable on |a, b] if and only if 
it is integrable on [a,c] and [c, }] for all c € (a,b). Thus, Theorem 6.1.22, together 
with an induction argument, shows that a bounded function on [a,b] having just a 
finite number of discontinuities is Riemann integrable. 

Moreover, to show that a function f is not Riemann integrable on |a, b], one can 
apply the negation of 6.1.21(b). To do so, one must identify an ¢ > 0 and show that 
U(f, P) — L(f, P) > for every partition P of [a, 0]. 


Example 6.1.23. Let h: [0,1] — R be defined by 


1, if 
h(z) = » ULE Q; 
0, ifx¢gQ 
We show f is not Riemann integrable. Let ¢ = 5 and ley P= 445.94, Wa, 4 ote} 


be any partition of [0,1]. Since Q and R \ Q are dense in R, we see that M; = 1 
aid ig = 0 tor-each t= 1, 2.4..,7 Vis, VP) = > Maa = Ae = 1 and 
i=l i=l 


L(h, P) = > m Az; = 0. So U(h, P) — L(h, P) =1> 4. Thus, h is not integrable. 
i=1 

Example 6.1.23 shows that a function with an infinite number of discontinuities 
may not be Riemann integrable. 


Remark. Georg Friedrich Bernhard Riemann (1826-1866) was famous for his work 
in non-Euclidean geometry, differential equations, and number theory. His results in 
physics and mathematics form the basis of Einstein’s theory of general relativity. 


Exercises 6.1 


1. Let f: [0,2] — R be defined by f(x) = x? —x whose graph appears in Figure 6.3. 
Consider the partition P = {0, 4,1, 3,2} of [0,2]. Find U(f, P) and L(f, P). 


120 y Oo 


*3. 


*A, 
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Figure 6.3: Graph of f(x) = x? — x for Exercise 1. 


. Let c be a constant. Suppose that f(x) = c for all x in [a,b]. Prove that f is 


Riemann integrable and that pe f =c(b—-a). 

Let f: [a,b] — Rand g: [a,b] > R be bounded. Assume that f(x) < g(x) for all 

x € |a, b]. Show that 

(a) U(f,P) < U(g, P) for every partition P of [a, }]. 

(b) L(f, P) < L(g, P) for every partition P of [a, }]. 

(c) U(f) < U(g) and L(f) < L(g). 

Let f: [a,b] — R be bounded and let c be a constant. Define g: [a,b] + R by 
gz) = f(a) +c. Show that 

(a) U(g, P) =U(f, P) + c(b — a) for every partition P of [a, 6]. 

(b) L(g, P) = L(f, P) + c(b — a) for every partition P of [a,b]. 

(c) U(g) = U(f) + eb — a) and L(g) = L(f) + c(b— a). 


. Let f: [0,1] — R be defined by 


1, if0<2<1:; 


n= {> P= 


Show that f is Riemann integrable and find i f 


6. Complete that proof of Theorem 6.1.22 by showing that U(f,P) — L(f, P) <e 


. Let f: [0,1] — R be defined by 


Ay ifxEeQ 
f= 1", ifc¢éQ. 


Show that f is not Riemann integrable. 


. Let f: [a,b] + R be bounded and let K € R. Prove that if for every ¢ > 0 there is 


a partition P of [a,b] such that U(f, P)-L(f, P) < eand L(f, P) <K < U(f,P), 
then f is Riemann integrable and [ in f=kK. 


. Let f: [a,b] + R be bounded. Suppose that there is a sequence (P,,) of partitions 


of [a,b] such that 


noo 
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(a) Prove that f is Riemann integrable. 


(b) Prove that lim Lf, Fi) = Si and lim Utica = ie fe 
10. Let f: [0,1] — [0,1] be defined by 


0, ifxER\Q 
f(t) = 44, ifeeQ* and x= @ is in reduced form; 
1, if#=0; 


for each x € [0,1], where m and n are natural numbers. One can show that f is 
Riemann integrable as follows: Let ¢ > 0 be such that ¢ < 1. 


(a) Let S = {@ € [0,1]: 2 € Q (in reduced form) and 4+ > §}. Show that 
S#@ and S is finite. 

(b) Let N = #S > 1, where #S denotes the number of elements in the finite set S. 
Let P = {z0,%1,...,%,} bea partition of [0, 1] such that POS = {z,} = {1} 
and Aa; < 54 for alli = 1,2,...,n. Break up the set {1,2,...,n} into the 
following two disjoint parts: 


A= {i le) OS Sh and B= {te a SH eh. 


(1) Show that >? M;Aa; < §. Note: 1 < #A < N, and M; <1 for alli ¢€ A. 
icA 
(2) Show that }° M;Az; < 5. Note: #B <n, and M; < § for alli ¢ B. 
iE€B 
Conclude that U(f, P) — L(f, P) <e. 
Exercise Notes: For Exercise 3(a)(b), use Exercise 20 on page 47. For Exercise 4, 
use Exercise 13 on page 46. For Exercises 5 and 7, use proof strategy 6.1.19(b). For 
Exercise 9, use Exercise 11 on page 74. 


6.2 PROPERTIES OF THE RIEMANN INTEGRAL 


In the previous section, the focus was on the definition of the Riemann integral and 
on a useful necessary and sufficient condition for its existence. This section will be 
directed toward deriving the important properties of the Riemann integral. 


6.2.1 Linearity Properties 


In this section, we will identify and prove several algebraic properties of the integral. 
Exercise 11 on page 46 and Definition 6.1.3 imply the following result, which will be 
used to prove the constant multiple rule for the Riemann integral. 


Lemma 6.2.1. Suppose that f: [a,b] > R is bounded and let k € R. Let P = 
{x0,%1,22,...,%n} be an arbitrary partition of [a,b]. If k > 0, then 


1. Mi(kf) = kM,(f) and m,(kf) = km,(f), for alli =1,...,n. 
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2. Utky,.P) SkUl sf, P) and LAAT, P) = kif, 2): 

If k < 0, then 

1. Mi(kf) = kmi(f) and mi(kf) = kM; (f), for alli =1,...,n. 
2. Utkf,, P)SkL GP) and LRT, P) =k fP). 


Lemma 6.2.2 (Constant Multiple Rule). Suppose that f: [a,b] > R is Riemann 
integrable and let k € R. Then kf is Riemann integrable and [ ig kf=kf 2 f. 


Proof. Let f: [a,b] + R be Riemann integrable and let k € R. We consider the cases 
(1) k > 0 and (2) k < 0 separately. 


CASE 1: k > 0. First let P be a partition of [a,b]. Lemma 6.2.1 implies that 
Ulkp, P= kU (FG, P) and TARF, P)= kL, P): (6.13) 
Recalling Definition 6.1.12, we see from (6.13) that 
U(kf) =inf{kU(f, P) : P is a partition of [a, b]} 
L(kf) =sup{kL(f, P) : P is a partition of [a, b]}. 


Theorem 2.3.13 implies that (i) U(kf) = kU(f) and (ii) L(kf) = kL(f). Since f 
is Riemann integrable, we know that U(f) = L(f). Therefore, (i) and (ii) imply 
that U(kf) = L(kf) and hence, kf is Riemann integrable. In addition, because 
U(kf) = kU(f), we have that fokf =k? f. 


CASE 2: k < 0. The proof is analogous to the proof of Case 1 and is Exercise 2. 


The following lemma will be used to prove the sum rule for integration. 


Lemma 6.2.3. Let f: [a,b] + R and g: [a,b] + R be bounded. For any partition 
P of [a, 6], we have that 


L(f,P)+L(g,P) < Uf +9,P) <U(f+9,P) < Uf, P)+U(g, P). 


Proof. Let f: [a,b] + R and g: [a,b] — R be bounded and let P = {2%0,21,...,2n} 
be a partition of [a,b]. For each i = 1,...,n, letting D; = [x;_-1, xi], Definition 6.1.3 
and Theorem 2.3.16 imply that 


mi(f) + mi(g) = inf(f[Di]) + inf(gDi]) < inf((f + g)[Di]) = mi(f + 9) 
Mi(f + 9) = sup((f + g)[Di]) < sup(f[Di]) + sup(g[Di]) = Mi(f) + Mi(g). 


Thus, m;(f)+m:(g) < m(f+g) and M;(f+g) < Mi(f)+Mi(g) for alli =1,...,n. 
Hence, by Definition 6.1.3, we have 


L(f,P) + L(g, P) < L(f + 9,P) and U(f +9, P) < U(f, P) + U(g, P). 


Since L(f +9, P) < U(f +9, P), we conclude that 


L(f,P) + L(g,P) < Lf +9,P) <U(f+9,P) < UF, P) + U(g, P). 
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Lemma 6.2.4 (Sum Rule). If f: [a,b] — R and g: [a,b] > R are both Riemann 
integrable, then f + g is Riemann integrable and CU + g)= te ft ie g. 


Proof. Let f: [a,b] — R and g: [a,b] > R be Riemann integrable. We prove the 
theorem in two steps. 


STEP 1: We will prove that f+ is Riemann integrable by applying Theorem 6.1.18. 
Let ¢ > 0. Since f and g are Riemann integrable, Theorem 6.1.18 implies that there 
are partitions P,; and P 2 such that 


Uh Pi) = bE Pay (6.14) 


U(g, Px) — L(g, Pa) < (6.15) 


NO] mM dO] 


Let P* = P, U Py. Since P* is a refinement of P, and Py, Theorem 6.1.9 implies that 


LF,Pi) < LF, P*) < UG, P*") s UF, Py) 
L(g, P2) < L(g, P*) < U(g, P*) < U(g, Pa) 
and thus, (6.14) and (6.15) respectively imply? that 
U(f, P*) — L(f, P*) < 5 (6.16) 
U(g, P*) — L(g, P*) < ; (6.17) 
Hence, (6.16) and (6.17) imply that 
U(f, P*) + U(g, P*) — (L(f, P*) + L(g, P*)) <e. (6.18) 


By Lemma 6.2.3, we have that 

L(f, P*) + L(g, P*) < L(f+9,P") <U(f+g9,P*) <UG,P*")+U(g, P*). (6.19) 
Thus, from (6.19) and (6.18), we conclude that U(f + g, P*) — L(f +9, P*) < «. 
Therefore, Theorem 6.1.18 implies that f + g is Riemann integrable. 


STEP 2: Now we shall prove that lige; +g) = ig + fg. Let ¢ > 0. Using the 
same argument as in Step 1, there is a partition P* satisfying (6.18) and (6.19). By 
Lemma 6.1.14 we also have that 


b 
BGP) a) FUG") (6.20) 
L(g, P*) < [fs SU(GP*) (6.21) 
Lg +9P)< ff +9) SU +9,P") (6.22) 


"Ife<a<y<dandd—c< §, theny—z< &. 
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By adding corresponding sides, (6.20) and (6.21) imply the next inequality (6.23). 
Also, (6.22) and (6.19) imply the subsequent inequality (6.24) 


b b 
LEP) +LG.P)< [f+ fgsUPy+uG.P) (6.23) 
WLP) +LG.P)< [+9 SULP)+UG.P). (6.24 


a 


Inequalities (6.18), (6.23), and (6.24) now imply* that 


[r+ fo- fer 


Corollary 2.2.4 now implies that ae +g)= Hs f+ i. g. 


<€. 


Lemma 6.2.2 and Lemma 6.2.4 establish two important algebraic properties of 
the Riemann integral. We now formally unite these two results into a single theorem. 


Theorem 6.2.5 (Linearity Theorem). Suppose that f: [a,b] > R and g: [a,b] > R 
are Riemann integrable and let k € R. Then 


(a) kf is Riemann integrable and i kf = ky f. 
(b) f +g is Riemann integrable and [°(f +9) = [? f+ f2g. 


Proof. Lemma 6.2.2 implies (a) and Lemma 6.2.4 implies (b). 


Corollary 6.2.6 (Difference Rule). Suppose that f: [a,b] + R and g: [a,b] — Rare 
Riemann integrable. Then f — g is Riemann integrable and CE —g)= ih f- fe g- 


Proof. By Theorem 6.2.5(a), —g is Riemann integrable. Since f — g = f + (—g), we 
have that f — g is integrable and ff —g)= ie f- i g, by Theorem 6.2.5. 


6.2.2 Order Properties 
We will show that Riemann integration preserves an order relation on functions. 


Theorem 6.2.7. Suppose that f: [a,b] + Ris Riemann integrable and that f(x) > 0 
for all w € [a,b]. Then [? f > 0. 


Proof. Let f: [a,b] + R be Riemann integrable where f(x) > 0 for all x € [a,}]. It 
follows that 0 < L(P, f) for every partition P of [a,b]. Thus, 0 < L(f). Since f is 
Riemann integrable, we have that [ 2 f = L(f). Therefore, [ : f>0. 


Corollary 6.2.8. Let f: [a,b] > R and g: [a,b] ~ R be Riemann integrable where 
f(a) < g(a) for all x € [a,b]. Then f? f < f?g. 

Proof. Suppose that f: [a,b] > R and g: [a,b] > R are Riemann integrable and that 
f(x) < g(x) for all x € [a,b]. Define the function h: [a, 6] > R by h(x) = g(x) — f(z). 
Clearly, h(x) > 0 for all x € [a, 6]. Corollary 6.2.6 implies that h is Riemann integrable 
and Theorem 6.2.7 implies that 0 < fe h. Thus, 0 < CG —g) and Corollary 6.2.6 
implies that (cz x Hh g. 


“lfc <a,y <dandd—c<e, then |x — y| <e. 
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6.2.3 Integration over Subintervals 


We will show that if a function is integrable on a closed interval [a, b], then it is also 
integrable on any closed subinterval of [a,b]. This will allow us to split an integral 
into two parts (see Theorem 6.2.11). 


Theorem 6.2.9. Let f: [a,b] — R be Riemann integrable on [a,b]. Let c be such 
that a <c <b. Then f is integrable on [a,c] and on [c, }]. 


Proof. Suppose that f is Riemann integrable on [a,b] and a < c < b. Let ¢ > 0. Since 
f is Riemann integrable on [a,b], there is a partition Q of [a,b] such that 


U(f,Q) — L(f,Q) <«. 


By Theorem 6.1.9, we can assume that c € Q. Let P= QN [a,c] and P’ = Qn |[c, bl. 
Thus, P is a partition of [a,c], P’ is a partition of [c, b], and Q = PU P’. Hence, 


U(f,Q) =U(F,P)+U(F, P’) 
L(f,Q) = L(f, P) + LF, P’), 


and thus, 
U(F,Q) — L(f,Q) = UGE, P) -— LF, P)) + (UF, P’) - LG, P<. 
We conclude that U(f, P)— L(f, P) < ce and U(f, P’) — L(f, P’) < «. Theorem 6.1.18 


thus implies that f is Riemann integrable on [a,c] and on [c, J. 


Corollary 6.2.10. Let f: [a,b] + R be Riemann integrable on [a,b]. Then for all 
c,d where a <c<d< b, we have that f is integrable on [c, d]. 


Proof. See Exercise 3 (apply Theorem 6.2.9 twice, at most). 


We now prove the converse of Theorem 6.2.9 


Theorem 6.2.11. Suppose that f: [a,b] + R is Riemann integrable on [a,c] and on 
[c, 6], where a < c < b. Then f is Riemann integrable on [a, b] and or = Lae f: 


Proof. Let f: [a,b] + R be Riemann integrable on [a, c] and on [c, b], where a < c < b. 
The proof will be done in two steps. In Step 1, we prove that f is Riemann integrable 
on [a,b], and in Step 2 we prove i = ie oe 

STEP 1: Let ¢ > 0. Since f is Riemann integrable on [a,c] and on [c,b], Theorem 


6.1.18 implies that there are partitions P; of [a,c] and P, of [c, b] such that 


U(f, Pi) — Lf, Pi) < (6.25) 


DO] M dO] M 


U(f, Pa) — Lf, Pe) < (6.26) 
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Let P = P, U Py, which is a partition of [a,b]. Since P, and P, both have c as an end 
point, we conclude that 
L(f, Pi) + Lf, Pa), (6.28) 


& 

oS 

a) 
T 


and thus, 
U(f,P) — LF, P) = (U(f, Pi) — UF, Pi) + (UE, Pa) - LF Pa) < 5 +5 =. 


So U(f, P) —L(f, P) < «. Thus, f is Riemann integrable on [a, b] by Theorem 6.1.18. 


STEP 2: Now we prove that bee = Aad cen f. Let ¢ > 0. Arguing as in Step 1, there 
are partitions P,, P,, and P = P, U P, that satisfy (6.27), (6.28), and the inequality 
U(f, P) — L(f, P) < «. Furthermore, Lemma 6.1.14 implies that 


LE, Pays [fs SUF Pi) (6.29) 
L(f, Po) < is <U(f, Po) (6.30) 
wg,P) < f F< UEP) (6.31) 


After adding (6.29) and (6.30), equations (6.28) and (6.27) imply that 


1G, P) =H PSL Pa) < fet fF SOU PSO) = OUP). (6.32) 


Since U(f, P) — L(f, P) < €, inequalities (6.31) and (6.32) imply? that 


(fr fy-L5 


Since € > 0 was arbitrarily chosen, we must have if f=fift+ vik f 


sa 


The Riemann integral is defined for a function whose domain is identified to be 
a closed interval [a,b], where a < b. We now introduce some standard conventions. 


Definition 6.2.12. Let f: [a,b] + R be Riemann integrable. Define [* f = 0 and 
a att ee 
Definition 6.2.12 and Theorem 6.2.11 now imply the following corollary. 


Corollary 6.2.13. Let f be integrable on every closed subinterval of an interval I. 


Let a,b,c be in J. Then 
b c b 
prafrefet (6.33) 


"If d<a,y <eand e—d<e, then |x —y| <e. 
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Proof. Let f be integrable on any closed subinterval of an interval J. Let a, b,c be in I. 
Ifa=b, b= c, or a=c, then (6.33) is easy to verify, using Definition 6.2.12. Assume 
that the points a,b,c are all distinct. These three points determine a subinterval 
of J; namely, the interval J = [min{a, b,c}, max{a, b,c}], where a,b,c are in J. By 
assumption, f is integrable on the closed interval J. To prove (6.33), we must consider 
all the possible orderings of a, b, c. However, we will just consider one such ordering, as 
the proof is similar for all of the other orderings of a, b,c. So, suppose that b<a<c. 
Then, by Theorem 6.2.11, we have that (a) ff f = ff f+ fof. Since b <a<e, 
Definition 6.2.12 and (4) imply that ey = =a hei + f° f. Thus, by algebra, we 
have that f? f= fo ft J? f. 


6.2.4 The Composition Theorem 


Theorem 4.1.13 implies that the composition of continuous functions is continuous. 
Moreover, Theorem 5.1.5 (the chain rule) shows that the composition of differentiable 
functions is differentiable. Is the composition of integrable functions integrable? The 
answer is, unfortunately, “not always.” There are examples of integrable functions 
whose composition is not integrable (see Exercise 10). However, our next theorem 
identifies a condition under which the composition of Riemann integrable functions 
will be Riemann integrable. 


Theorem 6.2.14 (Composition Theorem). If f: [a,b] > [c,d] is Riemann integrable 
and g: [c,d] > R is continuous, then (go f): [a,b] > R is Riemann integrable. 


Proof. See Appendix A on page 235. 


The Composition Theorem is a very useful result. For example, it implies the 
following corollary which states that the square of a Riemann integrable function is 
also Riemann integrable. 


Corollary 6.2.15. If f: [a,b] + R is integrable, then f? is integrable on [a, J. 


Proof. Assume f is Riemann integrable. Thus, f is bounded. So, there exists c,d € R 
such that f({a,b]) C [c,d]. Let g: [c,d] + R be defined by g(x) = 2?. Since g is 


continuous (Exercise 5 on page 108), Theorem 6.2.14 implies that go f is integrable 
on [a,b]. Clearly, go f = f? and thus, f? is Riemann integrable. 


Another important consequence of the Composition Theorem is that the product 
of Riemann integrable functions is Riemann integrable. 


Corollary 6.2.16. If f: [a,b] — R and g: [a,b] — R are integrable, then fg is 
integrable on {a, }]. 
Proof. Using algebra, we have fg = +I(f + 9)? —(f —g)?]. As f and g are Riemann 


integrable, Theorem 6.2.5, Corollary 6.2.6, and Corollary 6.2.15 imply that fg is 
Riemann integrable. 


The Composition Theorem also implies that the absolute value of a Riemann 
integrable function is Riemann integrable. 
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Corollary 6.2.17. If f: [a,b] — R is integrable, then |f| is integrable on [a, 6] and 


(eae tabi 


Proof. Let f be integrable. So f is bounded. Let c,d € R be such that f({a, b]) C [c, d]. 


Let g: [c,d] > R be defined by g(x) = |x|. As g is continuous (Exercise 4 on page 108), 
Theorem 6.2.14 implies that go f is integrable on [a, 6]. Clearly, go f = |f| and so, |f| 
is integrable. Since —|f(x)| < f(x) < |f(x)| for all x € [a,b], Corollary 6.2.8 implies 


that - If| < pes S i | f|. Therefore, 


OB alea alka 


Exercises 6.2 


1. 
*2. 
*3. 

A. 


Prove Lemma 6.2.1. 
Complete the proof of Lemma 6.2.2, by proving the lemma under the case k < 0. 
Prove Corollary 6.2.10. 


Find a function f so that |f| is integrable on an interval [a,b] but f is not. 


5. Let f: [a,b] + R be Riemann integrable. Let k > 0 be such that f(x) > k for 


*10. 


"11. 


. Let f: [a,b] — R, h: [a,b] > R, g: [a,b] — R be such that f(x) < h(x) < 9(2) 


all x € [a,b]. Prove that ; is Riemann integrable on {a, }]. 


. Let f: [a,b] > R be Riemann integrable. Suppose that f(a) > 0 for all x € |a, 0). 


Prove that \/f is Riemann integrable on [a, 0}. 


for all x € [a,b]. Suppose that f and g are integrable on [a,b] and fr 
Prove that h is integrable and f?h = f° f. 


Jeo: 


. Let f and g be Riemann integrable on [c, d]. Prove that 


fols[Lr)(L2)] - 


. Let f and g be Riemann integrable on [c, d]. Prove that 


[vor ae (oe (fs). 


Let f: [0,1] — [0,1] be the integrable function defined in Exercise 10 on page 162. 
The function g: [0,1] + R defined in Example 6.1.20 on page 159 is integrable. 
Show that go f = h, where h is defined in Example 6.1.23 on page 160. Conclude 
that go f is not Riemann integrable on [0, 1]. 

Let g: [a,b] > R be continuous, and let f: [a,b] — [a,b] be defined by f(x) = x. 
Show that if f Riemann integrable, then g is Riemann integrable. 


Exercise Notes: For Exercise 1, use Definition 6.1.3 and Theorem 2.3.13. For Exer- 
cise 4, consider the function f: [0,1] + R in Exercise 7 on page 161. For Exercise 5, 
apply Theorem 6.2.14. For Exercise 7, see Exercise 3 on page 161. For Exercise 8, first 
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note that (yf +g)? > 0 for any real number y. Thus, one can apply Theorem 6.2.7. 
Expand the integral Chi + g)?. By algebra, if ay? + by +c > 0 for all y, then 
the equation ay? + by + ¢ = 0 can have at most one real root. So, by the quadratic 
formula, we must have b? — 4ac < 0. For Exercise 9, expand the integral on the left 
side of < and then use Exercise 8. Also note that a+2,/avVb+ b = (/a+ Vb)? when 
a,b > 0. For Exercise 11, use Theorem 6.2.14. 


6.3. FAMILIES OF INTEGRABLE FUNCTIONS 


It is useful to identify certain functional properties which, if satisfied, will ensure that 
a function is integrable. In this section, we show that continuous functions, monotone 
functions, and functions of bounded variation are Riemann integrable. We also derive 
a mean value theorem for integrals. 


6.3.1. Continuous Functions 


Theorem 6.2.14 indirectly implies that a continuous function defined on a closed inter- 
val is Riemann integrable (see Exercise 11 on page 169). Nevertheless, we will present 
a direct proof of this important result. In our proof, we will apply Theorem 6.1.18. 


Theorem 6.3.1. If f: [a,b] > R is continuous, then f is integrable on [a, }]. 


Proof. Let f: [a,b] + R be continuous. By Theorem 4.4.1 f is bounded. Let ¢ > 0. 
As f is uniformly continuous on [a,b] by Theorem 4.5.5, there is a 6 > 0 such that 
E 

b-a 
Let P = {x0,%1,%2,...,%n} bea partition of [a, b] such that (a) Ax; = (4;—2;_-1) < 6 
for each 7 = 1,2,...,n. Now, for each such i, let M; = M;(f) and m; = m,(f) be as 
in Definition 6.1.3. By Theorem 4.4.3, f attains its maximum and minimum values 
on each subinterval [x;-1,x;]. Thus, for each i = 1,2,...,n, let ti, s; € [xj-1, xi] be 
such that (m) M; = f(t;) and m; = f(s;). Clearly, |t; — s;| < (a; — zj-1) = Ax; < 6 
by (a). Hence, by (6.34), we have (¥) f(ti) — f(si) < ;&, for alli = 1,2,...,n. Thus, 


for all x, 2" € [a,b], if |z — 2"| < 6, then | f(x) — f(2")| < 


(6.34) 


U(f,P) — L(f, P) = So (M; — mi) Az; by Lemma 6.1.7 


i=1 


= \ (F(t) — f(si))Aay by (») 


se 
< S- i “fi by (¥) 
i=1 
= pan by algebra 
“_(h-a)=e by Lemma 6.1.8 
= —-a) = Pe Eo 13 
eas y Lemma 


Theorem 6.1.18 now implies that f is Riemann integrable on {a, }]. 
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Corollary 6.3.2. Let k € R and f: [a,b] + R be the constant function defined by 
f(x) =k for all x in [a,b]. Then f is Riemann integrable and ihe f =k(b-a). 


Proof. Since f is continuous, it is integrable. Because k < f(x) < k for all x in [a, 6], 
Lemma 6.1.15 implies k(b — a) < f° f < k(b—a) and therefore, [? f = k(b— a). 


We can now show that the integral of a positive continuous function f over [a, | 
is equal to the area of a rectangle with base [a,b] and height f(c) for some c € [a, }]. 


Theorem 6.3.3 (Mean Value Theorem for Integrals). If f: [a,b] + R is continuous, 
then there is a point c in [a,b] such that fs f = f(o)(b—- a). 


Proof. Let f: [a,b] + R be continuous. If f is a constant function, the result follows 

from Corollary 6.3.2. Assume that f is not constant. By Theorem 4.4.3, f attains a 

maximum value M and a minimum value m on [a,b]. Thus, f(i) =m and f(j) = M 

for some i,j € [a,b]. As m < f(x) < M for all x in [a, 6], Lemma 6.1.15 asserts that 
b 


m(b—a) < fay < M(b-— a). Hence, m < jet < M. Theorem 4.4.6 implies that 
b 


fie= dat for some c in [a,b]. So sia = f(c)(b—a). 


We end this section with four applications of Theorem 6.3.3. The first three of 
these applications show how a strict inequality that holds for a function, affects the 
integral of the function. 


Corollary 6.3.4. If f: [a,b] > R is continuous and f(x) > 0 for all x in [a, 6], then 
PFS: 

Proof. Let f: [a,b] — R be continuous such that f(x) > 0 for all x in [a, 6). By 
Theorem 6.3.3, {of = f(c)(b— a) for some c in [a,b]. Since a < b and f(c) > 0, we 
conclude that _f . f >0. 


Corollary 6.3.5. Let f: [a,b] + R be continuous and f(x) > 0 for all x in [a, 0]. If 
f (xo) > 0 for some zo € [a,b], then ieee > 0. 


Proof. Let f: [a,b] + R be continuous, f(x) > 0 for all x in [a,b], and f(x) > 0 
for some Xo € [a,b]. Lemma 4.1.14 implies that there are real numbers c,d such that 
a<c<2a<d< band f(x) > 0 for all x in [c,d]. Corollary 6.2.10 asserts that f is 
integrable on [c,d]. Thus, fe f > 0 by Corollary 6.3.4. Corollary 6.2.13 implies that 
SOF H=SoFH+STF +S) f. From Theorem 6.2.7 we infer that f? f > 0. 


Corollaries 6.3.5 and 6.2.6 imply the following result. 


Corollary 6.3.6. Let f: [a,b] — R and g: [a,b] — R be continuous such that 
f(x) < g(x) for all x in [a,b]. If f(x) < g(xo) for some zo € [a,b], then Ae <i ee g. 
Corollary 6.3.7. Let f: [a,b] + R be continuous. If Hs f =Oand f(x) > 0 for each 
x in [a,b], then f(x) = 0 for all z in [a, 8). 


Proof. Let f: [a,b] + R be continuous, ie = 0, and f(x) > 0 for all x in [a, }). 
Suppose, for a contradiction, that f(x) > 0 for some x € [a,6]. Corollary 6.3.5 
implies that f[ hi f > 0 which contradicts the fact that [ i f=0. 
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6.3.2 Monotone Functions 


We now know that every continuous function, defined on a closed interval [a, }], is 
Riemann integrable. The set of all monotone functions on an interval [a,b] is another 
large collection of Riemann integrable functions. Moreover, there are many monotone 
functions that are not continuous. Recall that a monotone function is one that is either 
decreasing or increasing. 


Theorem 6.3.8. If f: [a,b] + Ris monotone, then f is Riemann integrable on [a, 6]. 


Proof. Let f: [a,b] — R be increasing. Thus, f(a) < f(x) < f(b) for all x € |a, }]. 
So f is bounded. Let ¢ > 0. Let & > 0 be such that (a) k(f(b) — f(a)) < © and 
P = {x9,1,%2,...,%} be a partition of [a,b] such that (») Az; = (aj — x1) <k 
for each i = 1,2,...,n. Since f is increasing, we have that (W) m; = f(a:-1) and 
M; = f(%;) forall’ = 1,2)... cn Thus; 


i=l 
25 nee by () 


e Sire) — f(2i1))k by (¥) 


i=1 
=kYO(f(#i) — f(ai-1)) by algebra 
i=1 
= k(f(b) — f(a)) by Exercise 9 on page 27 
<eé by (a). 


Thus, Theorem 6.1.18 implies that f is Riemann integrable. If f is decreasing, then 
—f is increasing and hence, —/f is integrable by the above argument. Corollary 6.2.2 
now implies that f is Riemann integrable. 


6.3.3 Functions of Bounded Variation 


We will now investigate the concept of a function having bounded variation. Such 
functions play a important role in the theory of integration and are closely related to 
monotone functions. The concept of a partition of an interval (see Definition 6.1.1) 
is needed in order to define a function of bounded variation. Our main goal in this 
section is to show that a function of bounded variation is Riemann integrable. 


Definition 6.3.9. Let f: [a,b] + R and let [c, d] be a closed subinterval of |a, b]. Let 
P be a partition of [c,d]. The P-variation of f over [c,d], denoted by V(f, P, [c, d]), 
is defined to be the real number 


n 


V(f,P,le,d)) = Se |f(xi) — f(zi-1)| 


1 


where P= {27;:0<i <n}. 
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One can increase a given variation of a function by using a refinement. 


Lemma 6.3.10. Let f: [a,b] — Rand [c,d] be a closed subinterval of [a,b]. Let P be 
a partition of [c, d]. If P* is a refinement of P, then V(f, P, [c,d]) < V(f, P*, |e, d]). 
Proof. Let f: [a,b] > R, [c,d], and P be as stated. Let P = {xo,%1,%2,...,2n} and 
P* be a refinement of P. So, the P-variation of f over [c, d] is 


n 


V(F,P.led]) = do \f (ai) — f(ai-)]. (6.35) 
1 
We will prove that V(f, P,[c,d]) < V(f, P*,[c,d]). First suppose that P* contains 
just one more point than P. So P* = PU {a*} and rp_1 < x* < xp for some 
natural number k where 1 < k < n. In equation (6.2), let us “isolate” the value 
| f (vp) — f(ve—1)| in the sum (6.35) for V(f, P, [c,d]) by rewriting this sum as 


V(f,P, [e,d]) = (s f(x) — Flea) | +1 Fer) — Fleer) 


Ff (xi) — f(a) (xk) — F(a") + Fle") — F(er-1)| 


+ [f(e) — Fe") + [F@*) — F@r-1)] 


\| 
—~ 
Ms 
Se Se ee 
+ 


a(S Fay fer) 
iFk 
SVGi Peo 
Therefore, V(f, P, [c, d]) < V(f, P*, [c, d]) when P* contains just one more point than 
P. If a partition P* contains 7 > 1 more points than P, then repeat the previous 
reasoning j times to arrive at the conclusion that V(f, P, |c,d]) < V(f, P*, [c, d]). 


Definition 6.3.11. Let f: [a,b] — R and let [c,d] be a closed subinterval of [a, 6). 
The function f is of bounded variation on [c,d] if there is an M € R such that 
V(f,P.[c, d]) < M for all partitions P of [c, d]. 


If a function is of bounded variation on an interval |a, b], then it will be of bounded 
variation on any closed subinterval of {a, }]. 


Lemma 6.3.12. Let f: [a,b] > R. Suppose that f is of bounded variation on |[a, }]. 
Let [c,d] be a subinterval of [a,b]. Then f is of bounded variation on [c, d]. 


Proof. Assume that f: [a,b] + R is of bounded variation on [a,b]. Let M € R be 
such that V(f,Q,|c,d]) < M for all partitions Q of [a,b]. Let [c,d] be a subinterval 
of [a, 6] and let P be a partition of [c,d]. Hence, Q = PU {a, b} is a partition of [a, }). 
It easily follows that V(f, P,[c,d]) < V(f,Q, [a, 6]) < M. Therefore, f is of bounded 


variation on |c, d]. 


Definition 6.3.13. Let f: [a,b] — R be of bounded variation on |c, d], a subinterval 
of [a,b]. The variation of f over [c,d], denoted by V(f, [c, d]), is defined by 


V(f, [c,d]) = sup{V(f, P, [c,d]) : P is a partition of [c, d]}. 
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The variation of a function measures how much its graph “moves up and down.” 
In other words, if a particle travels along the entire graph of the function, then the 
variation identifies the total vertical distance traveled by the particle. 

Let f: [a,b] — R be of bounded variation on [a, b]. Since P = {a, b} is a partition 
of [a,b] and V(f, P, [a, b]) = | f(b) — f(a@)|, from Definition 6.3.13 we see that 


|f(0) — fla)| < VCE, [a, 8). (6.36) 


Lemma 6.3.14. If f: [a,b] + R is monotone, then f is of bounded variation on 


[a, 6] and V(f, [a, ]) = [f() — F@I- 


Proof. Let f: [a,b] — R be monotone. In particular, suppose that f is increasing. 
Let P = {x0,%1,%2,...,%n} be a partition of [a,b]. Then 


n 


V(f,P, [a, b]) = S- |f(xs) — f(ai-1)| 


i=l 

= So) — f(xai-1)) as f(xi) — f(xi-1) > 0 for each i 
i=l 

= f(b) — f(a) by Exercise 9 on page 27 

= |f(b) — f(a)| as f(b) — f(a) > 0. 


Thus, V(f, P, [a, b]) = | f(b) — f(a)| for every partition P of [a,b]. We conclude that 
V(f, [a, 6]) = | f(b) — f(a)|. A similar argument applies if f is decreasing. 


We will eventually show that a function of bounded variation is Riemann integral. 
The next result is a first step in this direction. 


Lemma 6.3.15. If f: [a,b] — R is of bounded variation on |a, b], then f is bounded. 


Proof. Assume that f: [a,b] — R is of bounded variation on [a,b]. Thus, there is 
an M € R such that such that V(f, P,[a,b]) < M for all partitions P of [a,b]. Let 
x € [a,b]. Clearly, P = {a,x,b} is a partition of [a,b]. Thus, 


V(f,P, [a, b]) = |flx) — Fla) + [fF (6) — F(@)| <M. 


Thus, [f(2)| — [f(@)| < [f(e) —fla)| < M. Hence, |f(2)| < M + [f(a)| for all 
x € [a,b]. Therefore, f is bounded. O 


We now identify three algebraic closure properties which hold for functions of 
bounded variation. 


Theorem 6.3.16. Let f: [a,b] — R and g: [a,b] — R be of bounded variation on 
(a, b]. Then 

(1) kf is of bounded variation on [a,b], for any k € R. 

(2) f+ is of bounded variation on [a, }]. 

(3) fg is of bounded variation on [a, 0]. 
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Let c € (a,b). Lemma 6.3.12 shows that if f is of bounded variation on [a, 6], then 
f is also of bounded variation on the subintervals of [a,c] and [c,b]. The converse 
holds as well. Moreover, there is a simple equation that relates the total variation of 
f on these three intervals. 


Theorem 6.3.17. Let f: [a,b] + R and c € (a,b). If f is of bounded variation on 
[a,c] and on [c, 6], then f is of bounded variation on [a, b] and 


V(f, la, b]) = V(F, [a,e) + VF, [e, 4). 


Proof. Let f: [a,b] + Randc € (a, 6). Assume that f is of bounded variation on [a, c] 
and on [c, }]. Let P be a partition of [a,b]. Let P* = PU {c}. So P* is a refinement 
of P. Now let Q={x2e P*:a<chand S= {re P*:2>c}. Clearly, P* =QUS, 
Q is a partition of [a,c], and S is a partition of [c, b]. Therefore, 


V(f,P, (a, |) < V(f, P*, [a, d)) by Lemma 6.3.10 
= V(f,Q,[a,c]) + V(F,S,[e,b]) by definition of Q and S$ 
< V(f,[a, cl) +V(f, [e, 4) by Definition 6.3.13. 


So V(f, [a,c]) + V(f, [c, 6]) is an upper bound for the set 
{V(f, P, |a, b]) : P is a partition of [a, b]}. 


Thus, f is of bounded variation on [a,b] and V(f, [a,6]) < V(f, [a,¢]) + VC, [c, 4]). 
Suppose, for a contradiction, that V(f, [a,b]) < V(f,[a,c]) +V(f, [c, 6]). Then, as 


V(f,[a,6]) -—V(f lel) < VU la, el), 


there is a partition Q’ of [a,c] such that 


V(F,[a, 81) — VF; [e, 8) < V(F, Q"%, [a, e)). 


Since V(f,[a, 6]) — V(f, Q’, [a,c]) < V(f, [c, 6]), there is a partition S’ of [c,b] such 
that 
VE, [a, b}) — Mage ae la, c]) < VF, S le; b]). 


Hence, 
V(F, (a, 8]) < VF,Q", [a, el) + V(F, 5" [c,8]) =VF,Q'US", [a, )), 


which contradicts Definition 6.3.13 because Q’ U S’ is a partition of [a,b]. Therefore, 
we must have that V(f, [a,b]) = V(f,[a,¢c]) + VC, |e, 8). 


Corollary 6.3.18. If f is of bounded variation on [a,b] and a < c < d < b, then 
V(f,[a,d) < VF, [a, d)). 


Proof. Theorem 6.3.17, applied to the interval [a, d], implies that 
VF, la, el) + VF [e, d]) = VE, [a, d]). 
Since V(f, [c, d]) > 0, we see that V(f, [a,c]) < V(Jf, [a, d]). 
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Corollary 6.3.19. If f is of bounded variation on [a,b] and a < c < d < b, then 


V(f,[a,¢) — fo) < VF, [a, dl) — f(a). 
Proof. Theorem 6.3.17, applied to the interval [a, d], implies that 
V(f, le, dl) = V(F, la, d]) — VF, [a, e]). (6.37) 
By (6.36) on page 174, we have 
f(D) — flo) < IF(@) -— FQ) < VF Ie, a). (6.38) 
Thus, (6.37) and (6.38) imply that V(f, [a,c]) — f(c) < V(f, [a, d]) — f(d). 


Definition 6.3.20. Let f be of bounded variation on the interval [a,b]. Define the 
functions V: [a,b] > R and W: [a,b] — R by 


e V(x) = V(f, [a, z]) and 
e W(x) = V(f, la, 2]) — f(x) 
for all x € [a, 8). 


Corollaries 6.3.18 and 6.3.19 imply that the above functions V and W are in- 
creasing and, since f = V — W, we have the following two theorems. 


Theorem 6.3.21. If f is of bounded variation on [a,b], then f is the difference of 
two increasing functions on |a, )]. 


Theorem 6.3.22. If f is of bounded variation on [a,b], then f is integrable on [a, }]. 


Proof. This follows from Theorem 6.3.21, Theorem 6.3.8, and Corollary 6.2.6. 


Exercises 6.3 
1. Let f and g be continuous on [a,b] where iy a LG. Prove that f(c) = g(c) for 
some c € |a, }]. 
2. Let f and g be integrable on [a,b]. Suppose that g(x) > 0 for all x € [a,b]. Prove 
that if fg = 0, then ie t6 = (). 


3. Let f be continuous on [a,b] and let g be Riemann integrable on [a,b]. Suppose 
that g(a) > 0 for all x € [a,b] and [°g > 0. Prove that [? fg = f(c) fg for 
some c € |a, }]. 


4. Complete the proof of Theorem 6.3.8, by proving that the theorem holds when 
f: [a, 6] > R is decreasing. 

5. Prove Corollary 6.3.6. 

6. Suppose that f: [a,b] > R and g: [c,d] — [a,b] are both monotone. Prove that 
(fog): [c,d] > R is Riemann integrable. 


7. Let f be a function of bounded variation on [a,b]. Prove that |f| is of bounded 
variation on |a, }). 
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8. Prove part (1) of Theorem 6.3.16. 
9. Prove part (2) of Theorem 6.3.16. 
10. Prove part (3) of Theorem 6.3.16. 


11. Let f: [a,b] — R be of bounded variation on [a,b], and let V: [a,b] — R be as 
defined in Definition 6.3.20. 
(a) Prove the V is of bounded variation on [a, 0]. 
(b) Let c € [a,b]. Prove that if V is continuous at c, then f is continuous at c. 
(c) Prove that if f is continuous at c € [a,b], then V is continuous at c. 
(d) Suppose that f is continuous on [a,b]. Prove that f is the difference of two 


increasing continuous functions on [a, }]. 


12. Let f: [a,b] > R be a function of bounded variation on [a,b]. Prove that f has 
one-sided limits at each point c € [a, }] 


Exercise Notes: For Exercise 1, use the function h = f — g and Theorem 6.3.3. For 
Exercise 2, use Corollary 6.2.8 and the fact that f is bounded. For Exercise 3, read 
and modify the proof of Theorem 6.3.3, using Corollary 6.2.8. For Exercise 10, see 
the note for Exercise 15 on page 109. For Exercise 11(c), let c € (a,b). Since V is 
increasing, we see that lim V(r) < Vie) < lim, V (a) (see Exercise 14 on page 118). 
Show that these inequalities are not strict. For Exercise 12, see Exercises 12 and 13 
on page 118. 


6.4 THE FUNDAMENTAL THEOREM OF CALCULUS 


The Fundamental Theorem of Calculus, while taught early in elementary calculus 
courses, is actually a very deep result that identifies a key relationship between two 
central operations: differentiation and integration. Actually, there are two parts to 
this important theorem. The first part allows one to evaluate the definite integral of a 
function by using its antiderivative. The second part shows that continuous functions 
have antiderivatives. 


Definition 6.4.1. Let f: J > R where J is an interval. A function F: J > R is an 
antiderivative of f if F’(a) = f(x) for all x € I. 


Corollary 5.2.5 implies that any two antiderivatives of f differ by a constant on I. 
Moreover, if f: J > R is differentiable on J, then f is an antiderivative of f’. 


6.4.1. Evaluating Riemann Integrals 


The definition of the Riemann integral of a function g is rather complicated. More- 
over, in order to evaluate this integral one needs to evaluate the obscure value U(g). 
Suppose that g has an antiderivative over a closed interval. Our first fundamental 
theorem shows that one can evaluate the Riemann integral of this function simply 
by evaluating its antiderivative at the endpoints of the interval and then perform 
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a subtraction. It is remarkable that Riemann integration, a complex operation, can 
thus be reduced to antidifferentiation. 


Theorem 6.4.2 (Fundamental Theorem of Calculus I). Let f: [a,b] + R be differ- 
entiable on [a, 6]. If f’ is Riemann integrable on [a,b], then i f' = f(b) — f(a). 


Proof. Let f: [a,b] — R be differentiable on |a, b]. By Corollary 5.1.3, f is continuous 
on [a,b]. Assume that f’ is Riemann integrable on [a,b]. Thus, [° f’ = L(f’) = U(f"). 
We will show that L(f’) = U(f’) = f(b) — f(a). Let P = {x0, 71, 22,...,2n} be any 
partition of [a,b]. Since f is continuous on each subinterval [x;~1, 2;], Theorem 5.2.3 
implies that there exists a t; € (aj_1,7;) such that f’(t;) = Golatin i), that is, 


f (xs) — f(ai-1) = F's) (ei — 24-1). 


By Exercise 9 on page 27, we see that 3 (f (xi) — f(zi_1)) = f(b) — f(a). Thus, 


f(o) — f(a) = De (f (as) — f(ai-1)) = me f' (ti) («i — 4-1) = D PG )Age (6.39) 


Note that m(f’) < f’(ti) < Mi(f’) for each i = 1,...,n. Hence, 


= omit Aes oi f(t) Aa < 3 M,(f')Ax; =U(f’,P). (6.40) 


Equation (6.39) yields f(b) — f(a) = ss f'(t;)Ax;. So from (6.40) we infer that 
i=l 


L(f';P) < f() - fla) < UF, P). (6.41) 
Since P was arbitrary, we conclude that (6.41) holds for all partitions P. Thus, 
L(f') < f(®) - f(@ < Uf), 


by Definition 6.1.12. Since f’ is integrable, L(f’) = U(f’) by Definition 6.1.13. Hence, 
L(f!) = f(b) — f(a) = U(f"). Therefore, J. f’ = f(b) — f(a). 


6.4.2 Continuous Functions have Antiderivatives 


If f is Riemann integrable on the closed interval [a,b], then Theorem 6.2.9 implies 
that f is Riemann integrable on every subinterval of [a,b]. So, in particular, f is 
Riemann integrable on the subinterval [a,x] for each x € (a, b]. Thus, we can define 
a function F’: [a,b] > R by F(x) = J? f, where F(a) = 0 by Definition 6.2.12. We 
now show that the function F’ is uniformly continuous. 


Theorem 6.4.3. Let f: [a,b] — R be Riemann integrable. Define the function 
F: [a,b] > R by F(x) = J” f. Then F is uniformly continuous on [a, )]. 
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Proof. Let f: [a,b] + R be integrable. Let ¢ > 0. Since f is integrable, f is bounded 
(Definition 6.1.13). So let B > 0 be such that (a) |f(x)| < B for all x € [a,}]. Let 
6 = %. Let c,a € [a,b]. Assume that |x — c| < 6. Thus, 


|F(x) — F(c)| = i, s- f by definition of F 
= v. f+ fe f) -| f by Corollary 6.2.13 
= ig f | by algebra 
<Blx—-—c¢| by (4) and Lemma 6.1.16 
< Bo=e as |x —c| <d andé=—. 


B 


Therefore, |F'(x) — F'(c)| < ¢ and F is uniformly continuous on [a, 6). 


Let f be continuous on [a,b]. Our second fundamental theorem implies that the 
function F’ defined in Lemma 6.4.3 is an antiderivative of f (see Theorem 6.4.6). 


Theorem 6.4.4 (Fundamental Theorem of Calculus II). Suppose that f: [a,b] > R 
is Riemann integrable. Define the function F': [a,b] + R by F(x) = Jf” f. If f is 
continuous at c € [a,b], then F is differentiable at c and F’(c) = f(c). 


Proof. Let f: [a,b] > R be integrable, and continuous at c € [a,b]. We prove that 


im F@)- PO 


rc L—C 


= f(¢). 
To do this, let ¢ > 0. Since f is continuous at c, there is a 6 > 0 such that 
lft) — fl] < : when |t — c| <6 and t € [a, 8]. (6.42) 


Let x € [a,b] and assume that 0 < |x — c| < 6. Thus, 


_ = ([ f) -~ f(o) by Cor. 6.2.13 
= fe ae by Def. 6.1.13 
oe ee — | f(t) dt aoa | fe a by Cor. 6.3.2 
“Ta fae f(t)dt— ine a iyaleebee 
= aq|fvo-sore by Thin. 6.2.5, 
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Therefore, 
AO ~ 10) = 5 | [fu - onal s Gs le-d<e 


by (6.42) and Lemma 6.1.16. Thus, by Definition 5.1.1, F’(c) = f(c). 


6.4.3. Techniques of Antidifferentiation 


Let f: J > R where J is an interval. Recall that a function F’: I > R is said to be an 
antiderivative of f if F’(a) = f(x) for all x € I. Before we continue, we will slightly 
extend the notion of being Riemann integrable. 


Definition 6.4.5. Let f: J — R where J is an interval. We say that f is integrable 
over J if f is Riemann integrable on [a,b] for every [a,b] C J. 


If f: I > R is integrable over the interval J, then for each a € J we can define 
the function F: I + R by F(a) = J” f. Our next theorem shows that if f: I > R is 
continuous, then F is an antiderivative of f. 


Theorem 6.4.6. Let f: J — R be continuous on the interval J. Let F': I + R be 
defined by F(x) = J” f, where a € J. Then F is differentiable on IJ and F’(x) = f(x) 
for all x € I. 


Proof. Let f: I + R be continuous on J. By Theorem 6.3.1, f is integrable over I. 
Let a € I and define the function F: I > R by F(x) = f* f. Let e € J and be I be 
such that b 4 a and zx is between a and b. If a < x < b, then Theorem 6.4.4 implies 
that F(a) = f(a). Ifb < x < a, then Corollary 6.2.13 implies that i =F f+ye re 
Thus, fj f = F(x) +f; f and so, F(x) = fq f Je f. Hence, F'(x) = —(Jp f)! = F(a) 
by Exercise 1. 


So, by Theorem 6.4.6, if f: J + R is continuous, then {* f is an antiderivative 
of f whenever a € J. Evaluating the antiderivative f” f is often considerably more 
difficult than evaluating the derivative of f. Methods for evaluating antiderivatives, 
called techniques of integration, are covered in a calculus course. In this section we 
will establish three such techniques of integration. 

Now suppose that f: J — R is integrable over I but f is not continuous on J. Is 
the function [” f an antiderivative of f? The following result shows that if f does 
have an antiderivative, then {” f is also an antiderivative of f. On the other hand, 
there are integrable functions that do not have an antiderivative (see Exercise 4). 


Lemma 6.4.7. Let f: J > R be integrable over the interval J. Let a € J. If Gis an 
antiderivative of f, then for all « € I 


[ f=6@)-c@, 


and f* f is an antiderivative of f. 
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Proof. Let f: I + R be integrable over J and let G be an antiderivative of f. Let 
a € I. By Theorem 6.4.2, [” f = G(a«) — G(a) for all x in I. As (G(x) —G(a))’ = f(z) 


for each x € I, we conclude that {” f is an antiderivative of f. 


Our first technique of integration is often used to transform the antiderivative of 
a product of functions into an antiderivative that is easier to evaluate. 


Theorem 6.4.8 (Integration by Parts). Let f and g be differentiable functions on 
an interval J such that f’ and g/ are both integrable over J. Then fg’ and f’g are 
integrable over J and for any a € I, 


[ f= f@)g(0) - Haga) — [ fg, forall x € 1, (6.43) 


Proof. Let f and g be differentiable on J such that f’ and g’ are integrable over I. 
Corollaries 5.1.3, 6.2.16, and Theorem 6.3.1 imply that fg’ and f’g are integrable 
over I. By Theorem 5.1.4(3) (the product rule), we conclude that fg’ = (fg)’ — gf’. 
Let a € J and x € J. Definition 6.2.12, Corollary 6.2.6, and Theorem 6.4.2 imply that 


[tt = [a0 - [ fo= sea) - fase) - [Fo 


If one can evaluate the integral of f’g on the right in (6.43), then one can evaluate 
the integral of fg’ on the left. This technique is applied when the integral of f’g is 
easier to evaluate than that of fg’. 

We now present another useful method for finding antiderivatives. This technique 
involves reversing the chain rule. 


Theorem 6.4.9 (Integration by Substitution). Let g: I + J be differentiable on I 
and let f: J + R be continuous on J, where J and J are intervals. If g’ is integrable 
over I, then (f 0 g)g’ is integrable over I and for any a € J, 


x g(x) 
/ (foie = i f tora. (6.44) 
: (a) 


gla 


In particular, [ Ha f is an antiderivative of (f 0 g)q’. 


Proof. Let g: I + J be differentiable on J, f: J > R be continuous on J, and g’ be 
integrable over J, where J and J are intervals. By Theorem 6.2.14, Corollaries 6.2.16, 
5.1.3, and Theorem 6.3.1, (f o g)g’ is integrable over J. Let a € I. Define G: 1 > R 
by Guyaf a f. The function G is an antiderivative of (f o g)g' by Exercise 6. 


Since G(a) = 0, Lemma 6.4.7 implies that [’(f og)! = ce f for all x in J. 


If one can evaluate the integral of f on the right in (6.44), then one can evaluate 
an antiderivative of (fo g)g’ on the left. In other words, Theorem 6.4.9 allows one to 
transform certain antiderivatives, which are difficult to evaluate, into ones that are 
easier to evaluate. 
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You may recall from calculus that in order to evaluate antiderivatives of certain 
functions that involve the square root of a sum, or difference, of squares, one would 
use integration by trigonometric substitution, which is an application of a general 
technique of integration called inverse substitution. Integration by inverse substitution 
essentially inverts the antidifferentiation equation (6.44), in Theorem 6.4.9. This is 
reason why the method is called “inverse substitution.” This method is tacitly applied 
in calculus and real analysis books, but it is rarely stated and almost never verified. 
Recall that J* denotes the set of points in an interval J that are not endpoints of J. 


Theorem 6.4.10 (Integration by Inverse Substitution). Suppose that f: J > R is 
continuous, g: I > J is differentiable, J = g[J], and g’(u) 4 0 for all u € I*, where 
I and J are intervals. Then g~!: J — I exists and is continuous. Moreover, if g' is 
Riemann integrable over J, then for any a € J, 


ar 
[i =f Peay pelleew. (6.45) 
1c‘ 


Hence, f[ La fog)g’ is an antiderivative of f. 


Proof. Let f: J — R be continuous, g: 1 — J be differentiable, J = g[{J], and 
g'(u) 4 0 for all u € I*. Corollaries 5.1.3 and 5.2.21 imply that the inverse function 
g-!: J > I exists and is continuous. Given that g’ is Riemann integrable over J, 
Theorem 6.4.9 implies that (fo g)g’ is Riemann integrable over J and for any a € J, 


g *(2) g(g7* (x) 
| (Food =f ‘f= fF, forateed. 
g(a) (g-*(a)) 


Thus, (6.45) holds, and a fog)g is an antiderivative of f by Theorem 6.4.6. 


Theorem 6.4.10 is implicitly applied in calculus books to evaluate an antiderivative 
of a difficult function f. This is done by introducing an appropriate invertible function 
g such that one can evaluate an antiderivative of (fog)g’. Then, using equation (6.45), 
one can evaluate an antiderivative of f. The next two examples illustrate this method. 


Example 6.4.11. Let f: [—2,2] > R be defined by f(x) = V4 — 2?. Evaluate [*, f 
using Theorem 6.4.10. 


Solution. Clearly, f is continuous. Consider the right triangle in Figure 6.4. Thus, 


xz = 2sin(@). Let g: [-5, 5] — [-2,2] be defined by g(@) = 2sin(@). The function 


7) 
V4—x? 


Figure 6.4: Trigonometric substitution triangle. 
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g is differentiable and has range [—2, 2] which is also the domain of the function f. 
Moreover, g'(@) # 0 for all 0 in (—5, 5). (Observe that g'(—5) = 0 and g/() = 0.) 
Thus, g~': [—2, 2] + [—%, $] exists and is continuous. Now, using the substitution 


x = g(0), where @ € [—5, 5], we obtain 


(f 0 g)(0) = f(g(0)) = 1/4 — 4sin?(@) = 2cos(0), and g/(@) = 2cos(6). 
Thus, f(9())g9’(0) = 4cos?(@). By Theorem 6.4.10, we have that for all x € [—2, 2], 
~*(a) g*(2) 
[f =i (fog)g =F 4 cos*(6) dé. (6.46) 
g-\(-2) g~*(—2) 


As 4cos?(9) has antiderivative 2(sin(0) cos(@) + @) and g~'(—2) = —3, we conclude 
from Theorem 6.4.2 that the equation 


‘(a) 
14 =f 4 cos”(0) dO = 2(sin(g~'(x)) cos(g~*(x)) + g-‘(a)) —a (6.47) 


holds for all x € [—2,2]. Since g(@) = 2sin(@), by solving the equation x = 2sin(@) 
for 6, we can obtain a formula for g~!(x). Then one can replace g~'(x) in (6.47) 
with this formula and obtain a explicit solution; however, there is another equivalent 
method. Since x = g(9) implies that g~!(a) = 0, we can use Figure 6.4 to infer that 


V4 — 2? 
2 
7) and (6.48), we obtain 


x \/4 — x2 
/ f=2 (5 ; “4+sin7 (3)] —f, for all x € [—2, 2]. 
=3 


Example 6.4.12. Let f: (—3, $)  R be defined by f(x) = eee Evaluate 
Jo f using Theorem 6.4.10. 


sin(g ‘(x)) = =, cos(g~*(x)) = , and g"'(x) =sin7’ (5) ; (6.48) 


HR NIS 


Therefore, using (6. 


Solution. The function f is continuous. Now let g: (—1 


,1) > (—§, 5) be defined by 
Tas = sin”!(t), the inverse sine function. The fnceon @ is differentiable, has range 
(—§, 5), and g/(t) #0 for all t in (—1,1). Thus, g“1: (-3, 5) — (—1,1) exists and 
is continuous. From calculus, we have the identities 


cos(sin7*(t)) = V1 — #? 


d tice —.— a 
rm —(sin-“(t)] = a 


Using the substitution x = g(t) and the above two identities, we obtain 


t+1 iia 1 
M90) = Gy rae nd 90 = ae 


Hence, f(g(t))g'(t) = aa which has antiderivative 4. Since g~!(x) = sin(x) and 
g 1(0) =0, Theorem 6.4.10 and Theorem 6.4.2 imply that 


x sin(x) 1 os 1 i 
[ f= (1—t)? ~~ 1—sin(z) 
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6.4.4 Improper Integrals 


The definition of the Riemann integral is based on a bounded function that is defined 
on a closed and bounded interval. Under these conditions, if the integral exists, then 
the integral is said to be proper. There are cases, however, in which these conditions 
can be relaxed by relying on the limit concept. 


Unbounded Intervals 


In the definition of Riemann integral [ ig f, we assume that the domain of f has the 
form |a,b]. However, in many applications in physics, economics, and probability, 
one must integrate over unbounded intervals. We must therefore give meaning to 


operations like the following: 
/ fand / fi (6.49) 


Any integral having one of the forms in (6.49) is called an improper integral. 


Definition 6.4.13. Let c be a real number. Given a function f: [c,oo) — R such 
that f Riemann integrable on |c, ¢] for every £ > c, we define 


[oreim fit () 


For any function g: (—co,c] > R such that g Riemann integrable on [¢,c] for every 
 <c, we define 


i g= lim g. (II) 


l->—0o Je 
In (I) and (11), if the limit on the right exists and is finite, then the corresponding 


improper integral converges. Otherwise, the improper integral diverges. 


Example 6.4.14. Let f: [0,00) + R be defined by f(x) = e~*. Since F(x) = —e~* 
is an antiderivative of f, we have 


loo ysl 


if = tim f Pim =FOy 0s Si. 
0 
Thus, the improper integral 5° f converges. 


Unbounded Integrands 


In the definition of Riemann Integral [ ie f, we assumed that the integrand f was 
defined for all x in the interval [a,b]. However, in some cases of interest, we may 
have that f(a) or f(b) are undefined. In either of these cases, we also call f[ iH f an 
improper integral. We can give meaning to such integrals. 


Definition 6.4.15. Given a function f and an finite interval [a, b], if f(b) is undefined 
and f Riemann integrable on |a, 4] whenever a < ¢ < b, we define 


[s-pm ft (i) 
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If f(a) is undefined and f Riemann integrable on |¢, b] whenever a < ¢ < b, we define 


b b 
_ jj / e 
vi PL Pee (ii) 
In (i) and (ii), if the limit on the right exists and is finite, then corresponding improper 


integral converges. Otherwise, the improper integral is said to a 


Example 6.4.16. Let f: [0,1) + R be defined by f(z) = 7. aa f(1) is 
undefined. Since F(a) = —In(1 — 2) is an antiderivative of f, we sie 


[i=am [t= me (£) — F(0)) =o. 


l1 l>1- 


Thus, the improper integral iF f diverges. 


Exercises 6.4 

*1. Let f be continuous on [a, }]. Let G: [a,b] + R be defined by G(x) = i. f. Prove 
that G’(x) = —f(z). 

2. Let g: [0,1] — R be defined by 


1, ife<1: 


W= 45 ife=1. 


The function g is Riemann integrable (see Example 6.1.20). Show that fj g = 
for all c € (0, 1]. 


3. Let f be continuous on [0,1]. Suppose that fj f = x for all x € [0,1]. Prove that 
f(x) = 1 for all x € [0,1]. Using Exercise 2, show that this conclusion may not 
hold if f is not continuous. 


*4. The function g: [0, 1] > R in Exercise 2 is Riemann integrable. Show that g does 
not have an antiderivative. (See Theorem 5.2.17.) 


5. Let f: [a,b] > R be continuous and let g: [a,b] + R be differentiable on |[a, }]. 
Define H: [a,b] > R by H(x) = f’ g(x) f(t) dt. Find H'(z). 


*6. Let f be continuous on an interval J. Suppose that g: I > J is differentiable on 
the interval J. Let a € J. Define the function G: I > R be 


C3) 
ow) = fF 
Prove that G is differentiable and that G’(x) = f(g(x))g'(«) for all x € J. 


7. Let f be continuous on an interval J and h: I > J be differentiable on the 
interval J. Let 6 € J. Define the function k: J > R by 


ka) = f . f 


Prove that k is differentiable and that k’(x) = —f(h(a))h'(x) for all x € J. 
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8. 


10. 


11. 


12. 
13. 
14. 
15. 


16. 
17. 
18. 
19. 


20. 
21. 


Let f be continuous on an interval J. Let g: [> Jandh: I > J be differentiable 
on the interval J. Define k: J > R by 


Prove that k’ exists and that k’(a) = f(g(x))g'(x) — f(h(x))h'(x) for each x € I. 


. Let f be continuous on [a,b] such that f(x) > 0 for all x € [a,b]. Define 


F: [a,b] > R by F(x) = J f. Prove that F is strictly increasing on [a, }]. 


Let f: [0,00) > R be continuous with f(x) > 0 for all x € [0, co). Suppose that 
(f(x))? = 2 Jo f for all x € [0, 00). Show that f(x) = x for all x € [0, 00). 


Let f: [a,b] + R be continuous such that f” f = lia for all x in [a,b]. Show 
that f(x) = 0 for all x in [a, 6). 


Use Theorem 6.4.8 to evaluate {” fg’ when f(x)g'(x) = xsin(z). 
ae and g(2) _ Jt. 
and g(x) = 2”. 


Use Theorem 6.4.9 to evaluate f7'(f0og)g’ when f(x) = 


Use Theorem 6.4.10 to evaluate f7’ f when f(x) = Tn 


Let f: [a,b] + R be Riemann integrable. Show that jim ie — i cit 
=e 

Evaluate [5° ve~" da. 

Show that f7° + dx converges for p > 1 and diverges for p < 1. 

For what values of p < 0 does if x? dx converge? 


Assume that [°,, f and J°° f converge for any real number c. Show that the sum 
Sooo f + JO f is independent of the choice of c. 


Evaluate ho’ ak dx and f° 


AS 
Define f: (0,00) + R by f(x) = 2 and L: (0,00) > R by L(x) = ff f. Let 
a € R* and bE R*. Validate, in order, the following properties of L. 


(a) L is continuous on (0,00). 


) 
) 
) 
(e) L(ab) = La 1) + L(b). 
(fy te?) = ae ), for all integers n > 0. 
(g) (L(4))' = —+ for all x € (0,00). Thus, L(4) = —L(x)+C for a constant C. 
(h) L(%) = L(a) — 10). 
(i) L(a”) = nL(a), for all integers n < 0. 
(j) Let y € R. Let m,n € Z be such that m < La <n soda”) <4" <2"): 
) 


(k) ran(L) = R, and there is a unique real number e > 0 such that L(e) = 1. 
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22. Let L: (0,00) > R be as defined in Exercise 21. As L is strictly increasing, the 
inverse function L~!: R — (0,00) exists. Let E(x) = L~+(2) for all x € R. Thus, 
E(x) = y if and only if L(y) = @ for all real numbers x and y. Let c € R and 
d € R. Validate the following: 


(a) E is differentiable and E’(x) = E(x), for all x € R. 
(b) E£ is strictly increasing on R. 
(c) E(c)E(d) = E(c+ d). 
(d) Fh = Blc—d). 
(e) E(nc) = (E(c))”, for all integers n. 
23. Let L: (0,00) + R and e be as defined in Exercise 21. 
(a) Show that lim LO+2) == ly (c) Show that lim (1+ 4)P=e. 
ia n> Co 
(b) Show that lim L((1+5)") =1. 


o) 


Exercise Notes: For Exercise 1, Corollary 6.2.13 implies that cs = feft is bi 
For Exercise 2, one can use Theorem 6.4.3 to show that if g = 1. For Exercise 5, 
see Theorem 6.2.5(a). For Exercise 6, use the chain rule and Theorem 6.4.6. For Ex- 
ercise 7, use Definition 6.2.12 and Exercise 6. For Exercise 8, use Corollary 6.2.18, 
Exercise 6, and Exercise 7. For Exercise 9, apply Theorem 5.2.8. Exercises 13 and 14 


offer two methods for finding an antiderivative of ——~. Exercise 21 presents a 


(it2)2/z' 

formal definition of the natural logarithm function L(x), which is often denoted by 
In(a). Exercise 22 presents a formal definition of the natural exponential function 
E(«), which is usually denoted by e”. For Exercise 23(b), use part (a) and Theo- 


rem 4.3.4. For Exercise 23(c), see Theorems 3.4.8 and 4.2.1. 
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Infinite Series 


Around 1665, Newton discovered that any binomial of the form (1+ 2)", where r ¢ N 
and |x| < 1, can be expressed as an infinite series. In 1673, Leibniz discovered that 
m can be written as an infinite series. For Newton and Leibniz, infinite series were 
of fundamental importance in their development of calculus; however, during these 
times, questions of rigor and convergence were of secondary importance. 

One typically first studies infinite series, in a standard calculus course, without 
going deeply into the theory of infinite series. In this chapter, using the results in 
Chapter 3 on the convergence of sequences, we will prove the important theorems 
that concern an infinite series of real numbers. In Section 7.1, we introduce the concept 
of a convergent infinite series, and discuss a variety of different kinds of infinite series. 
We then examine in Section 7.2 a range of tests for convergence. Finally, in Section 7.3 
we investigate the effect of regrouping and rearranging the terms of an infinite series. 


7.1. CONVERGENCE AND DIVERGENCE 


As we have seen, the completeness axiom and the limit operation are critical compo- 
nents in real analysis. In particular, these concepts are used to develop the derivative 
and the Riemann integral. The limit concept was also used in our study of sequences, 
and it will again be used in our examination of infinite series. 


Definition 7.1.1 (Infinite Series). Given a sequence (a,,) of real numbers, the nth 
partial sum s,, is defined by 


n 
Sn = > Op = a, +2 +43 +--+ + ap. 
k=1 


[o.e) 

We refer to the sequence (s,,) of partial sums as the infinite series 5> a;. The real 
k=1 

numbers aj, @2,a3,... are called the terms of the infinite series. 


The notion of convergence or divergence of an infinite series is defined in terms 
of the convergence or divergence of the sequence of its partial sums. 


n 
Definition 7.1.2. An infinite series }~ a, converges if the sequence of partial sums 
k=1 


189 
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n 


[oe 
(Sn) converges to a real number L, and we write >> a, = lim s, = lim >> a, = L. 


[oe) 
We refer to L as the sum of the series )> ax. If the sequence (s,,) does not converge, 


[ee) 
then the series }> a,x is said to diverge. 
k=1 


CO 
It will often be convenient to write }> a, = a, + a2 + a3 +---. However, the 
k=1 


notation 3 ay does not denote an “infinite sum.” The notation = az represents the 
k=1 k= 
sequence (s,,) of partial sums and it also represents the limit of the sequence (Sn) 


of these partial sums, if the limit exists. The context in which the notation s dr is 
k=1 
being used should make its meaning clear. 


[oe 
Example. The nth term of the sequence of partial sums (s,,) of the series >> aa is 
k=1 


fie i 


=i] ‘ 
3 Ds 2h 


al el 


by the Shift Rule 1.4.8 and Theorem 1.4.7. Since lim Sn = 1, we see that >> = =. 


k=1 
Example. The sequence of partial sums (s,,) of the series )> (—1)**1 is given by 
k=1 
Ss, = T=] 
$2 = 1-—1=0 


s3=1-1+1=1 
s4=1—-1+1-1=0 


Thus, we see that the sequence (s,,) does not converge. Thus, the series diverges. 


As illustrated in the above two examples, the convergence of an infinite series 
depends on the convergence of the sequence of its partial sums. In many cases, we 
will not be able to evaluate the limit of a convergent infinite series; but, we will be 
interested in the question: Does the series converge or diverge? It is this question 
that makes the theory of infinite series most interesting. 


Theorem 7.1.3. Let > ax be a series such that az > 0 for all k. The series = ak 


converges if and only if “thie sequence (s,,) of partial sums is bounded above. 


Proof. Let » ay, be a series with nonnegative terms. For each n €N, let s, be the 
=1 
nth ere sum of this series. It follows that (s,) is an increasing sequence. 
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(=). Assume that > a, converges and let > a, = L for some L € R. Since (s,) is 
increasing, Lemma 3. ra 2 implies that s, < L oi all n. So (s,) is bounded above. 


(<=). Suppose that (s,,) is bounded above. As (s,,) is increasing, Theorem 3.4.6 implies 


that (s,) converges, that is, }> a, converges. 
k=1 


Remark 7.1.4. It is possible for an infinite series to start at 0 or a natural number 

m > 1. Any such series can be “reindexed” to an equal series with a new starting 
lee) 

integer n. Consider the series 5> ay with starting value m. To rewrite this series as 


k=m 
one with the new starting value n, first compute 


s = old start — new start = m—n. 


Then S> ap = SS agys. For example, & B= » DF and » b, = & brat. 


k=m k=n = 


Definition 7.1.5. A series 5° by is a telescoping series, if there is a sequence (a) 
k=1 
such that one of the following holds: 


(i) Each term b, can be written as by = ag — p41. 


(ii) Each term b, can be written as by = an41 — Gr. 


1 1 


lee) 
é | ree! Gees i lie 3° 2 f : 
Example. Since gz = % — aT: the series p> ap 18 a telescoping series. 


The partial sums of a telescoping series can be easily simplified by cancellation 
and thereby, there is a simple way to determine it’s sum, if it converges. 


Theorem 7.1.6 (Telescoping Series). Suppose that 5° by is a telescoping series. 
k=1 


1. If by, = ap—apxy, for allk € Nand lim a,4,1 exists, then S> by = ai-( lim Gn41)): 


2. If by = ax41—ax for all & € N and lim Gn+1 exists, then > b, = ( lim Qn41)— a4. 


k=1 NCO 
Proof. We prove item 1. The proof of 2 is similar. Suppose that each term b, can be 
written as by = ap — Gp41 and lim An41 exists. The sequence of partial sums (s,,) is 
nr CO 
given by (see Exercise 9 on page 27) 


n n 


Sn = >) be = > (01 — Ops) = 1 — Ong. 


k=1 k=1 


CO 
Thus, S> by = lim sy, = a, — (lim @n41): 
k=1 noo noo 


CO 
7 1 ‘ : . 1 eae!) 1 an 
Example. The series » Pag i8a telescoping series as tk — &~ (ett) — k— Cht1- 


Since a, = 1 and Jim ne = ii eat =; Py Pak = 1 by Theorem 7.1.6(1). 


N—- Oo 
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CO 
Let S> by be a telescoping series as in the statement of Theorem 7.1.6. If lim An41 
k=1 noo 


loc) 
does not exist, then >> by diverges. 
k=1 
Since an infinite series converges if and only if the sequence of its partial sums 


converges, Definition 7.1.2, Theorem 3.2.2, and Exercise 12 on page 67 easily imply 
the following theorem. 


Theorem 7.1.7 (Linearity). If $> a, and >> by converge and c is a constant, then 
k=1 k=1 


(1) So (ae t+ be) = DS ag + YF dp; 
hel | RE 


(2). 32 bap 6 ae 
k=1 k=1 


As alluded to earlier, the theory of infinite series is mainly concerned with the 
convergence or divergence of a series. We now show that a series cannot converge if 
its terms do not approach 0. 


Theorem 7.1.8. If >> ag converges, then jim ap = 0. 
k=1 oo 


CO 

Proof. Let s, be the nth partial sum of the convergent series }> az. Let lim s, = L. 
k=1 noo 

Note that an = Sn — Sn—1 for each n > 2. Thus, 


lim a, = lim s, — lim s,_; = L-L=0. 
noo noo noo 


The contrapositive of Theorem 7.1.8 is identified in our next theorem 


Theorem 7.1.9 (Divergence Test). If (a;) does not converge to 0, then the series 


>> ax diverges. 
k=1 


k _ . . 
Ea = 1, Theorem 7.1.9 implies 


[oe 
For example, consider the series » Fat: As jim, : 


[oe 
that >> fon diverges. The converse of Theorem 7.1.8 is false; that is, the condition 
k=1 


[oe 
jim ay, = 0 does not imply that >> az, converges. This is verified by out next result. 
oo k=1 


Theorem 7.1.10 (Harmonic Series). The harmonic series > i diverges. 
k=1 


Proof. Consider the nth partial sum: 
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Imagine n to be very large and write 


ee Be house + 
eran ay oes ae ms 
1 (3 ') (: i | ) (3 ‘) 1 
ft i es Basia Eee vef 
D a4 a) ey ae 9 16 n 
ail (| ') (: Dd ee ty dil 
Oo NAc tA 8's 8 8/' \16— fier fo a n 
S448 Posi ide etd foe 
- OO eo n 


By taking n sufficiently large we can introduce as many 3's in the above sum as we 
wish. In fact, one can prove by induction that sgn >1+n- 5 for all n € N. Thus, the 
sequence of partial sums (s,,) is not bounded above. Hence, Theorem 7.1.3 implies 
that the harmonic series diverges. 


A geometric series is any series that can be put into the form 


CO 
Scart =atartar?+---+ark+---, 
k=0 


where a 4 0 and r are constants. Geometric series have applications in many areas 


outside of mathematics; for example, physics, engineering, biology, economics, and 
computer science. 


[o,e) 
Theorem 7.1.11 (Geometric Series). Let a be nonzero. The geometric series $> ar* 
k=0 
CO 
converges if |r| < 1 and diverges if |r| > 1. Moreover, if |r| <1, then > ar* = ~*. 
k=0 


Proof. Let a # 0. If |r| > 1, then the sequence (ar*) does not converge to 0. Thus, 
by Theorem 7.1.9, the series S> ar* diverges. Suppose that |r| <1. For each n > 0, 
k=0 


rrti_y 


the nth partial sum is (A) sp, = es, ar*§ =a —;—, by Theorem 1.4.7. Since |r| < 1, 


lim rt! — 0 by Corollary 3.1.12. Thus, lim Sn = 7% by (a). So YF ar® = = 


[o.e) 
Any series of the form >> ar* where m > 0 is also a geometric series. However, 


k=m 
the formula in Theorem 7.1.11 for the sum of a geometric series is valid only when 
the index starts at 0. Nevertheless, using Remark 7.1.4, one can rewrite the series as 
one with a starting value 0. For example, 


oo Pe Le AD 1\*¥ 2 5 
y5(5) => 5(5) =y 7G) —1-d 18 


k=3 k=0 


Recall that a sequence converges if and only if it is a Cauchy sequence. This 
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lee) 
leads to our next theorem. Let 5° a, be an infinite series and, for each n € N, let 
k=1 


n 
Sn = >> ax be the nth partial sum. Whenever n > m > 1, we have that 
k=1 


n 
Sn — 8m = Om41 F1* + an = os ak. 
k=m+1 


Thus, Theorem 3.6.5 implies our next result. 
Theorem 7.1.12 (Cauchy Series Criterion). An infinite series }> a, converges if 


n=1 
and only if for each ¢ > 0, there is an N such that for alln >m>N, 


n 


S- ar| = |Qm41 +--+ + an| <e. 
k=m+1 


CO 
Given a series 5> ay, = a, +a2+a3+--- and an integer j > 0, consider the series 
k=1 


[oe) 
D2 Aj+e = j41 + Gj42+4;43 +--+ which is obtained by eliminating the first 7 terms 
k=1 


lee) [oe) [eoe) 
of the series }) az. The series >) aj+x is called a tail of the series >? ay. Moreover, 


k=1 =1 k=1 
for n > 1, the nth partial sum t, of this tail series is 
tn = D_ dj4e = A541 + O42 + Oj43 + ++> + aj4n. 
k=1 
Example. For the series }> 7 =1+ | + ‘ + a + 3 +--+, let 7 = 3. Then the tail 
k=1 

series is 

se 1 asc Jl 4: IY xtc 1 i: 

9 —* T T 

ome (k + 3) 16 25 36 49 64 

with the third partial sum t3 = + + = ++ aie 


Theorem 7.1.13 (Tail Convergence Test). Consider the series }> ay, and let 7 > 1 
k=l 


[oe) [ee) 
be an integer. Then >?) az converges if and only if >) aj, converges. 
k=1 k=1 


Proof. Let j > 1 and let s, = >7 az and t, = >) aj+x, for each n € N. Thus, for all 
k=1 k=1 


J 
n EN, we have the identity (A) sj4n = (x «| + tn. Therefore, 


[ee) 

S- dz converges iff (s,) converges by Definition 7.1.1 

k=1 iff (s;4,) converges by Corollary 3.3.11 
iff (t,) converges by (a) 


[e,e) 
iff S- aj+z converges — by Definition 7.1.1. 
k=1 
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Exercises 7.1 


1. 
2. 


3. 


10. 


11. 


12. 


13. 


Prove Theorem 7.1.7. 


Prove that if 5° (ag, + bg) and >> by, converge, then >> ax converges. 
k=1 k=1 k=1 


Let >> ay, be a convergent series. For each k € N, let 
k=1 
J ak, if a, > 0; Jae, if a, < 0; 
PP Ne. Aba ONO. see 0: 


(a) Show that ay = pe + for all k EN. 


(b) Prove that 5° pz, converges if and only if >> 7, converges. 
k=1 k=1 


. Prove Theorem 7.1.12. 
. Show that s 
n=1 


diverges. 


a ae 
Vnt+1+/n 


. Show that 3 t Ge diverges. 


k=0 


. Show that 3 3 eu converges and find its sum. 


k=0 


. Given that 


1 1 
34+V34+14+—54+54+--: 
V3 3 


is a geometric series, find its sum. [Hint: a = 3 and ar = V3.] 


[o.e) 
. Show that >> KET) converges and evaluate its sum. 
k=1 


Let (x,) be a sequence of nonnegative real numbers, and let yy, = &p — Ln41 for 
each n EN. 


CO 
(a) Prove that >> y, converges if and only if the sequence (x,,) converges. 
n=1 


(b) If 5° yp converges, then what is the sum? 


n=1 


Let a £0 and |r| < 1. Find the sum of ¥ ar?*, 
k=0 


Let a 40 and |r| < 1. Find the sum of §¢ ar?**1, 
k=0 


[oe) 
Suppose that >> |a,| converges and that (b,) is a bounded sequence. Using 
n=1 


Theorem 7.1.12, prove that >> anb, converges. 
n=. 
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7.2 CONVERGENCE TESTS 


[oe) 
Given any infinite series > ag, there are two significant questions: 
k=1 


[oe) 
1. Does >> ax converge? 
k=1 


co 
2. If S> ay, does converge, can we evaluate its sum? 
k=1 


In Section 7.1, we presented methods for evaluating the sum of convergent geometric 
series and telescoping series. In general, however, it is difficult to determine the exact 
sum of a convergent series. Thus, in this section, our focus will be on establishing 
several tests that may be used to determine whether or not a series converges. 


7.2.1 Comparison Tests 


If you know that a given series converges (diverges) and you want to know if another 
series converges (diverges), then you may be able to use a comparison test to resolve 
your inquiry. Our first comparison test follows. 


Theorem 7.2.1 (Direct Comparison Test). Let > a, and > b;, be series such that 
O < ax < by for all k > N, for some integer N > 0. 
(1) If 5° 6, converges, then >> az converges. 

k=1 k=1 


(2) If >> ax diverges, then >> b; diverges. (This is the contrapositive of (1).) 
k=1 k=1 


Proof. Let >* az and >> by be such that (A) 0 < ag < by for all k > N > O. For 
k=1 k=1 
each n € N, let sp = >> aniz and th = u by+kz. Assume that >> bg converges. 
k=1 k=1 
Theorem 7.1.13 implies that the tail series = bv+r converges. So, by Theorem 7.1.3, 


there is a real number B such that t, < B oe all n € N. From (A), it follows that 
Sn < ty < B, for all n € N. So (s,) is bounded above. Theorem 7.1.3 implies that 


>> an+ez converges. Hence, >> ax converges by Theorem 7.1.13. 
k=1 k=1 

Given a series with nonnegative terms, to apply Theorem 7.2.1, one should find a 
convergent (divergent) series whose terms are nonnegative and are eventually larger 
(smaller) than the corresponding terms of the given series. To find this relevant 
series, it often requires using properties of inequality. For example, consider the series 


Ds Ze On can verify that oe > 0 for all k > 1. Observe that for k > 5, we 


a that Baits 2 ss a2 es SOE = & = t: By Theorems 7.1.10 and 7.2.1, we 


conclude that 2 ae at 2. diverges. 
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Our next comparison test is often easier to apply than the direct comparison test, 
because it removes the need to verify that the required inequality holds between the 
terms of the two sequences. In our next theorem, we write 7 < oo to mean the x is 
a real number. 


Theorem 7.2.2 (Limit Comparison Test). Let S> ay and > by be such that a, > 0 
k=1 k=1 
and b; > 0 for all k EN. 


(1) If0< jim i < oo, then >> ax converges if and only if >> b, converges. 
00 k=1 k=1 


[e.e) [oe) 
(2) If lim #* =0 and >> by converges, then >? a, converges. 
k-r00 k=1 k=1 


CO CO 
(3) If lim #* = co and ¥? ay converges, then >) by converges. 
k—s00 k=1 k=1 


Proof. Let > ay, and bs by be series where ay > 0 and by > O for all k € N. To prove 


item (1), let ce = jm 5, “a be a positive real number. Since 0 < L and ¢ > 0 for all 


k EN, Theorem 3.1.24 ae Lemma 3.2.5 imply that there are positive os numbers 
M and N such that M < Be < N for all k € N. Since b, > 0 for each k © N, it 
follows that 


Mbp < ay < Nby for all k. (7.1) 
If 5 by converges, then s Nb, converges by Theorem 7.1.7. Thus, (7.1) implies 
k=1 k=1 
that S a, converges by Theorem 7.2.1, the Direct Comparison Test. 
k=1 


If >> ax converges, then (7.1) implies that u Mb, converges by Theorem 7.2.1. 
k=1 


Since M 4 0, Theorem 7.1.7(2) implies that > by converges. 


The proofs of items (2) and (3) are left io Hens 1 and 2, respectively. 


Given a series, to apply the Limit Comparison Test, one needs to find a relevant 
companion series to compare with the given series. If a sum or difference of powers 
of k appear in the numerator or denominator of the given series, delete all but the 
highest power of & in numerator and in denominator. For ee consider once 


[oe 
As instructed, we obtain the series we: p=b i: One 
k=1 k=1 


aon 
_3k+2_ 


can verify that jim #=e+5 — 3. Thus, by Theorems 7.1.10 and 7.2.2(1), the series 
—0o 


k 


[oe) 
again the series }> a 
k=1 


lee) 
3k4+2 4: 
p> his diverges. 
Geometric series and p-series (see Theorem 7.2.5) offer a collection of companion 
series that are frequently used in applications of Theorem 7.2.2. 
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7.2.2 The Integral Test 


Suppose that a nonnegative decreasing continuous function can be used to determine 
the terms of a series. Then there is a convergence test for the series that requires one 
to evaluate an improper integral of the function. 


Theorem 7.2.3 (Integral Test). Let f: [1,c0o) + R be a decreasing and continuous 
function such that f(x) > 0, for all x in [1,00). Let ay = f(k) for all k € N. Then 


3 a, converges if and only if if f converges. 
k=1 


Proof. Suppose that a, = f(k) for all k, and f: [1,00) + Risa decreasing continuous 
function such that f(a) > 0, for all x in [1,0o). Note that each a, > 0. Since f is 
decreasing, we have that 


angi = f(kK+1) < f(x) < f(k) =ay, for allk e@Nandallze[k,k+1]. (7.2) 


Since k + 1—k = 1, (7.2) and Lemma 6.1.15 imply that 
k+1 
Opa <S f <x, for all k EN. (lad) 
k 
Consider the sequences (ax+1), (e's ), and (a,). Theorem 7.2.1 and (7.3) imply 


that > ay converges if and only if bs ( as f) converges. By Theorem 6.2.11 and 


Definition 6.4.13, we also have that 


90 k+1 n k+1 n+l lve) 
= lim i) = lim i a ) 


k=1 


where the last equality holds because f(x) > 0, for all x in [1,00). Therefore, 3 ar 
k=1 


converges if and only if [7° f converges. 


The following definition identifies a generalization of the harmonic series. 
[oe 
Definition 7.2.4 (p-series). A p-series is any series of the form )> va where p > 0. 
k=1 


In the definition of a p-series, the value p is a fixed positive real number. The 
integral test is used to establish the following p-series test. 


Theorem 7.2.5 (p-Series Test). The p-series >> e converges if p > 1 and diverges 
k=1 


Posy: 


Proof. If p = 1, then PE: pw is the harmonic series, which diverges by Theorem 7.1.10. 
So assume that p Pi 1 and p > 0. Define f: [1,00) — R by f(x) = x”. Then 
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a = f(k) for all k EN, and f is positive, continuous, and decreasing on [1,0o). We 
now evaluate the improper integral 


L 
oP Creo tS ifs: 
i x ? dz = lim e = jim ——— = 2P- B 
1 f>00 | 1—p Fi f>c00 l—-p Oo, ifp< 1. 


Thus, by Theorem 7.2.3, >> & converges if p > 1 and diverges if 0 < p< 1. 
k=1 


7.2.3 Alternating Series 


The tests in the above two sections involve series with nonnegaitve terms. We will 
now discuss series whose terms alternate between positive and negative values. 


Definition 7.2.6 (Alternating Series). An alternating series is a series having one 


of the forms .s (—1)**1a, or 3 (—1)*a,, where a; > 0 for all k. 
k=1 k=1 


For example, Xl 1)R1E = 1-54 5----+(-1)**1£4--- is alternating series. 


Theorem 7.2.7 (Alternating Series Test). Let > (—1)**1a, be a series such that 
k=1 


CO 
ap > Ap41 > 0 for all k, and jim ap = 0. Then S> (—1)**1a, converges to a sum 9 
00 k=1 
such that |S — s,| < a+, for all n, where s,, is the nth partial sum of the series. 


CO 

Proof. Suppose that the series $7 (—1)***a, is such that (A) a, > ag41 > 0 for all k, 

k=1 

and (¥) jim ay = 0. Consider the “even” partial sums 59, 54, 86,...,52an,..- Where, 
—> 00 


for each n € N, 


San = @1 — AQ + G3 — A4 ++ + G2n-1 — Gan} 
82(n+1) = @1 — Ag + 3 — Ag + * + AQn-1 — G2n + Gen+1 — G2(n+1) 
= $2n + A2n+1 — G2(n41)- 


So, for any n CN, Sa(n41) — San = Gan41 — Ga(n4i1) = 0, by (a). Thus, (s2n) is an 
increasing sequence. Note that, for every n € N, 


San = @1 — (@2 — ag) — (a4 — a5) — +++ — (dan—2 — Aan—1) — Gan < a4 (7.4) 


aS dan > 0 and ag — apy, > 0 for all k € N. Hence, the sequence (s2,,) is bounded 
above by a; and thus, Theorem 3.4.6 implies that (s2,) converges. Let lim $9n = 8. 


Now we consider the “odd” partial sums 51, 53, 55,...,S2n—-1,---. Observe that for 
each natural number n, 


San-1 = A, — 2 + A3 — Ag +++ + Aan—13 


$2n = A, — G2 + G3 — G4 + +++ + Aan-1 — Gan 


= $2n-1 — 42n- 
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Thus, 52n-1 = S2n + Gan, for all n € N. By (¥), we have that lim dan = 0. Since 
lim Son = S, Theorem 3.2.2 implies that 
lim sg,-1 = lim sa, + lim aa, = S+0=S. 
noo noo noo 
So, by Exercise 14 on page 79, Jim Sy, = S. Therefore, iG 1) eae = S: 


We will now prove that sie: error estimate |S — ) x Gn+41 holds for all n. By 
Exercise 3, (s2n_1) is a decreasing sequence. Since (s2,) is an increasing sequence, 
(San_1) is a decreasing sequence, and lim so, = S = lim s2,_4, it follows (see 

n—0o noo 
Corollaries 3.4.3 and 3.4.5) that 


San SS < Santi S San-1, 
for each n € N. Thus, for all natural numbers n, we have that 
|S — Son| < |San4i — S2n| and |S — san-i| < |Sen — Soni. 


Therefore, |S — s,| < |Sn41 —S,| for all n € N. Since 5,41 — 8, = Gn41 > 0, we 
conclude that |S — s,| < @j41 for every natural number n. 


Since the alternating series 5° (—1 aie: 1 satisfies the conditions of Theorem 7.2.7, 
k=1 
it converges. We note that since S> (—1)*t"a, = (—1) 4) (—1)*ax,, Theorem 7.2.7 also 
k=1 k=1 


CO 
applies to alternating series of the form > (—1)*ax. 
k=1 


7.2.4 Absolute Convergence 


The partial sums s,, of a series with nonnegative terms forms an increasing sequence 

(Sy). For this reason, such a series is typically easier to investigate than a series with a 
[oe 

varied arrangement of positive and negative terms. Given a series 5> a, with positive 
k=1 


[oe) 
and negative terms, consider the series u |a,| with nonnegative terms. Can we use 


the series » |a,,| to determine whether or not % a, converges? Our next theorem, 
k=1 
addresses this question. 


Theorem 7.2.8. Let s ax, be a series of real numbers. If 5° |az| converges, then 
k=1 k=1 


[oe 
the series 5* a, converges. 
k=1 


Proof. Let n > m be natural numbers. By the triangle inequality, we have that 


n n 
> aR) < a |ax| - 
k=m+1 k=m+1 


Thus, the result follows from Theorem 7.1.12. 
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Definition 7.2.9. Let >> a, be a series of real numbers. 
k=1 


CO [oe) 
1. If 3° |a,| converges, then the series 5> ag is said to be absolutely convergent. 
k=1 k=1 
[o@) 
2. If > dp converges and = |az| diverges, then >> a, is said to be conditionally 
k=1 
eanvercent. 


Moreover, if a series is absolutely convergent, then we will say that it converges 
absolutely. Similarly, if a series is conditionally convergent, then we will say that it 
converges conditionally. 


Clearly, by Theorem ee if a series is absolutely convergent, then the series itself 
converges. So, suppose that = dr is a series with positive and negative terms. Since 


the tests in Sections 7.2.1 cag 7. i 2 all concern series with nonnegative terms, some of 


these tests may determine that 3 |az| converges, and if it does, then Theorem 7.2.8 
k=1 


would imply that 5> a, converges. On the other hand, the converse of Theorem 7.2.8 
k=1 


[o.@) 
does not hold, that is, if a series }> a, converges, we cannot conclude that 3 |ax| 
_ k=1 _ k=1 
converges. For example, >°> (—1)**1 4 converges, but 3 |(-1 yRrts| => i does not 
k=1 k=1 k=1 


CO 
converge. Thus, >> (—1)*t12 is conditionally convergent. 
k=1 


7.2.5 The Ratio and Root Tests 


As mentioned above, one may be able to use some of the tests already introduced to 
determine if a series converges absolutely. But these tests do not work for all series. 
In this section we shall derive two new tests which often prove to be useful in showing 
that a series is absolutely convergent. These tests are called the Ratio Test and the 
Root Test. These tests deal with the terms of a given series by measuring the rate 
at which the terms converge to 0. Of course, we know that if the terms of a series 
converge to 0, this does not imply that the series converges. However, the proofs of 
these new tests show that if this rate of decrease is comparable to that of a convergent 
geometric series, then this will ensure convergence. 


The Ratio Test 


The Ratio Test measures the rate of decline of the terms of a series a1, a2,...,@x,. 
by investigating the long-term behavior of the ratio jose. If this rate is less than: 1, 
then the series will converge. The proof of the test relies on the direct comparison 


test and a geometric series. 


Theorem 7.2.10 (Ratio Test I). Let 5° a, be a series of nonzero real numbers. 
k=1 
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< p for 


(1) If there is real number p < 1 and a natural number N such that re 


all k > N, then > az, converges absolutely. 
k=1 


> 1 for allk > N, then >> ax diverges. 
k=1 


(2) If there is an N € N such that 


Qk+1 
aK 


Proof. Let >> ax be series of nonzero real numbers. To prove (1), let p < 1 and let 
k=1 
N €N be such that |*s++ 


< p for all k > N. It follows that 


lan+i] < plax| for all k > N. (7.5) 
Using (7.5), one can prove by induction on k that 

lansk| < p* |ay| for all k EN. (7.6) 
Since 0 < p < 1, the geometric series » p* \ay| clearly converges by Theorem 7.1.11. 
Thus, (7.6) and Theorem 7.2.1 ee S |an+4%| converges. Since > lan+e| is 
a tail of the series > |axz|, we conclude fini rgSen 7.1.13 that the Lee > |an| 
converges. Therefore, 3 ap converges absolutely. : 


k=1 
To prove (2), let N be a natural number such that 


Qk+1 
ak 


> 1 for alln > N. Thus, 
lan41| > |ax| for all k > N. 


CO 
Since |ay| > 0, the sequence (az) does not converge to 0. Hence, >> ag diverges by 


k=1 
Theorem 7.1.9. 


Theorem 3.8.16 and Theorem 7.2.10 imply the following more familiar appearing 
version of the ratio test (also see Exercise 12.) 


Corollary 7.2.11 (Ratio Test II). Let }° a, be a series of nonzero real numbers. 


k=1 
(1) If lim sup oe <1, then 5° a, converges absolutely. 
k- 00 k=1 
(2) If liminf |“*| > 1, then >> ax diverges. 
k- 00 k k=1 


The Root Test 


The Root Test is similar to the Ratio Test, except that it measures the rate of decrease 
of the terms of a series a1, d2,...,@%,... by examining the long-term behavior of the 
value */Ja;|. Again, if this rate is less than 1, then the series will converge. The proof 
of this test also depends on the direct comparison test and a geometric series. 
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Theorem 7.2.12 (Root Test I). Let 5° a, be an infinite series. 
k=1 


(1) If there is a positive real number p < 1 and a natural number N such that 


¥/|ax| < p for all k > N, then » az converges absolutely. 
=1 


(2) If </|a,| > 1 for infinitely many natural numbers k, then >> a, diverges. 
k=1 


Proof. To prove (1), let p € Rt and N € N be such that Jax] < p < 1 for all 
k > N. It follows, by induction, that 


laniz| < op’ p* for all k EN. (7.7) 
Since 0 < p < 1, the series $7 p% p* converges, by Theorem 7.1.11. Thus, (7.7) and 
k=1 
Theorem 7.2.1 imply that >> |an+%| converges. Since >> |ay+x| is a tail of the series 
k=1 k=1 


> Jax], Theorem 7.1.13 and Theorem 7.2.8 imply that 5° az, converges absolutely. 
k=1 k=1 


To prove (2), assume that ¥/|a,| > 1 for infinitely many natural numbers k. Thus, 
|ax| > 1 for infinitely many k. So the sequence (ax) does not converge to 0. Hence, 
CO 


the series 5> ax diverges by Theorem 7.1.9. 
k=1 


Theorem 3.8.16 and Theorem 7.2.12 imply the following modified version of the 
above root test (also see Exercise 13.) 


Corollary 7.2.13 (Root Test ID). Let 3 a, be an infinite series. 
k=1 


(1) If lim sup (/Jax| <1, then >> az converges absolutely. 
k- 00 


(2) If limsup ¥/|a,z| > 1 or (4/[axl) is not bounded above, then 3 a, diverges. 
k—-00 


k=1 


Theorem 3.8.17 implies that if ratio test II decides either the convergence or the 
divergence of a series, then root test II will arrive at the same conclusion. So root test 
II is considered to be a stronger test. However, ratio test II is often easier to apply. 

Recall that if a sequence (s,) of partial sums diverges, then the corresponding 


lee) 
series }> a, is said to diverge. Moreover, if (s,,) diverges to co (see Definition 3.7.1), 
k=1 


CO [o.@) 
then we will write >> a, = co. We also write S> ax = —on, if (s,) diverges to —oo. 
k=1 k=1 


Exercises 7.2 


*1. Prove Theorem 7.2.2(2). (See Theorem 3.1.24.) 
*2. Prove Theorem 7.2.2(3). 
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*3. 


*D. 


10. 


11. 


*12. 


. Test each series for convergence: (a) S Wee (b) 
k= V* 


Under the hypothesis of Theorem 7.2.7, show that the “odd” sequence (s2n—1) is 
decreasing and bounded below. 


. Let » (—1)*ax be a series such that az, > ag41 > 0 for all k, and jim arp = 0. 
00 


Show that Theorem 7.2.7 implies that PG 1)*a, converges to a sum T such 


that |T — s,| < a@n41 for all n, where s, is the nth partial sum of the series. 
Let 3 ax be a series. For each k €N, let 
k=1 
J ak, if a, > 0; J aR, if a, < 0; 
Ba. ae. Po abe SO: 
(a) Show that |ax| = px — n, and ag = pe +g, for all k EN. 
(b) Prove: 5> a, converges absolutely if and only if 5> pz and >> 1, converge. 
k=1 k=1 k=1 
(c) Suppose that > a, converges conditionally. Prove that >> pz = oo and 
k=1 k=1 


>> Ne = —co (see Exercise 3 on page 195). Then prove that S> pz = oo 
k=1 


=m 


and > Nk = —00, for each m EN. 


. Prove that if > a? and > b? converge, then > axby converges absolutely. 


2 k=1 
[Hint: (a — b)? > 0.] 


[oe) 
. Suppose that >> a, is absolutely convergent and (b;) is a convergent sequence. 


k=1 


[oe) 
Prove that u abe converges absolutely. 


. Using the integral test, show that > sr converges. 


ee 
Xe 
Using a comparison test, test each series for convergence: 
[oe) CO [oe) 
i 1 1 3k+2 
(a) > geez (b) ars (c) OB (d) 2 1 Ve(BK+5)? (e ) > Tate 
Determine whether or not the following alternating series converge. 


G@) SDs b) DVM (6) DD 


Let >> by be an series of nonzero terms. Show that Theorem 7.2.10 implies: 
k=1 


(1) If lim ses >> by converges absolutely. 
(2) atk Jim Bee “i! = oo, then Py bj, diverges. 
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*13. Let >> a, be an infinite series. Show that Theorem 7.2.12 implies the following: 
k=1 


(1) Te lim Vlaz| <1, then >> ay, converges absolutely. 


(2). IE lim V/|az| > 1 or lim ¥/|ax| = 00, then >> ax diverges. 


14. Use an absolute convergence test to show that the following series converge. 


(a) 4 (-1)" ss, (b) > 


15. Test each of these series for convergence: 
CO k CO k CO 
Ce wee ie 
k=1 k=1 k=1 
16. Show that the series 


(a) > (—1)*Z is absolutely convergent 


(b) 33 (—1)**"4 conditionally convergent. 
k=1 


17. Test each of these series for convergence: 
1 el 

a a, (b i. 

(a) & age (b) & 
18. Suppose that the series > az is conditionally convergent where a, 4 0 for all 

k=1 
k € N. Show that 5° ™ diverges, where for each k € N, 7m is defined by 
k=1 


J ak, if a, < 0; 
MOS Gees Bans 


7.3. REGROUPING AND REARRANGING TERMS OF A SERIES 


Given a finite sum of real numbers, say a] + a2 + a3 + 44+ 45, by the associative law 
we see that 

a, + ag + ag + a4 + a5 = a1 + (2 + 3) + (a4 + G5). 
So we can regroup the terms of the finite sum and get the same answer. Clearly, any 
regrouping of the terms in a finite sum will yield the same sum. Moreover, by the 
commutative law, we have that 


a, +a2+03+ 04+ a5 = A4+ AQ + a5 + 43+ Q4. 


Thus, by associativity and commutativity, we can rearrange the terms of a finite sum 
get the same result; that is, the order in which one performs the addition does not 
affect the sum. In the same way that one can change the terms of a finite sum, one 
can alter the terms of an infinite series. Can we reach the same conclusions for an 
infinite series that we did for a finite sum? In this section, we address the following 
question: What happens to the convergence or divergence of an infinite series if the 
terms of the series are regrouped or rearranged? 
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7.3.1 Regrouping 


A regrouping of an infinite series is obtained simply by “inserting non-overlapping 
parentheses” in which each term is enclosed within one pair of a parentheses. For 
example, consider the infinite series 


Co 
So ag = ay + a2 +03 +04 +05 +ag+-°°. 
k=1 


We can construct a new series by regrouping the terms of the series as follows 


co 


do (@2x-1 + aan) = (a1 + a2) + (a3 + a4) + (a5 +46) +2 (7.8) 
k=1 
By using different regroupings, one can construct many other new series as well. 
Suppose that a given series converges. Does a regrouping of the series also converge? 
The answer is yes; but, before proving this fact, we must first present a precise 
definition of a regrouping of a given series. 


Definition 7.3.1. Let 5° a, be a given series. Let w: N - N be strictly increasing; 
k=1 


that is, for all m and nin N, if m < n, then w(m) < y(n). Define a new series Ly by, 
k=1 
as follows: 


by = (a1 +42 +--+ ayy) 
b= (auay41 + Qy(ayg2 Fo + ay(2) 
(7.9) 


bp = (Qug—ayn1 TF Oep(e—1)-+2 T° ay(n)) 


[oe) [oe) 
The series }> by is said to be a regrouping of the series 57 ax. 
k=1 k=1 
The regrouping in (7.8) can be formally defined by using the function w: N > N 
defined by w(k) = 2k. The next theorem shows that if a series converges, then any 
regrouping of the series will converge to the same value. So a convergent infinite series 
satisfies a generalized associative law. 


[ee) 
Theorem 7.3.2. Let >> ag, be a series that converges to the real number a, and 
k=1 


[oe [oe 
let >> by be a regrouping of >> ax using the strictly increasing function w: N > N. 
k=1 k=1 


[oe 
Then >> by converges to a. 
k=1 


n n 
Proof. For each n € N, let sn = S> ax and th = >> by be the respective nth partial 
k=1 k=1 
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sums. Assume that (s,) converges to a. Observe that t, = Syn) for each n € N 
(see (7.9)). Thus, (tp) = (suqn)) and so, (tn) is a subsequence of (s,,). Theorem 3.3.4 


implies that (t,) converges to a. 


In a manner similar to Definition 7.3.1, one can define the regrouping of any 


series having the form >> az (see Exercise 1). Theorem 7.3.2 also holds for such a 
k=0 


regrouping. 


7.3.2 Rearrangements 


Suppose that A = aj + a2 +43 +---+4n, where aj, d2,03,...,@, are real numbers 
and n is a natural number. Thus, A is a finite sum of real numbers. Since addition 
satisfies the commutative law, if one evaluates a rearrangement of the terms in this 
sum, say, d4+a,+a4,+--+:+4a7, then one will obtain, again, the original value A. Now, 


[oe) 
suppose that the infinite series }> a, converges to A. What happens if we rearrange 


the terms in this infinite devien? Will the rearranged series converge to A, as well? 
We will now confront this question. First, we must precisely define what it means to 
rearrange the terms of an infinite series. 

If 7: N > N be one-to-one and onto N, then o is a permutation of N. If o is 
such a permutation, then 


o(1), 0(2),0(3), 0(4), o(5),... 


produces a reordering of all the natural numbers, when a is not the identity function. 
For example, let ¢: N > N be defined by 


(7.10) 


n—1, ifn is even. 


ies 1, if n is odd; 
a(n) = 


The function ¢ is one-to-one and onto N. By evaluating o(1), 0(2), 0(3), 0(4), 0(5),... 
we obtain the following simple reordering of the natural numbers: 


DLA B65 OF ak 


[oe 
Definition 7.3.3. Let >> a, be an infinite series and let 0: N > N be a permutation 
k=1 


[oe) [ee) 
of N. The series 7 a, x) is called a rearrangement of the series >? ag. 
k=1 k=1 


To illustrate this notion, let 7: N > N be the permutation of N given by (7.10). 


[oe) 
Applying o to the indices of >> ag, we obtain the rearrangement 


k=1 
[oe) 
Se nea rere 
k=1 


[oe [oe 
Now, suppose that a series }7 a; converges to A and > a(x) is a rearrangement 
k=1 k=1 
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of > az. Will the rearrangement converge? If so, will it converge to A? Lemma 7.3.4 


ict Colney 7.3.5, below, provide a partial answer. 
The proof of Lemma 7.3.4 takes advantage of the following observation: Whenever 
o is a permutation of N, then Theorem 1.3.9 implies that o~! is also permutation of 
N such that o~!(o(k)) = k for all k € N. Thus, > a, is a rearrangement of > ag,x). 
k=1 k=1 


CO 

Lemma 7.3.4. If the series }> p, converges to A where pz > 0 for all k € N, then 
k=1 

any rearrangement of this series converges to A. 


Proof. Suppose that s pe = A and that (A) py > O for all k € N. Let o bea 
k=1 


[oe [oe 
permutation of N. So, >? po x) is a rearrangement of >) pr. For each n €N, let 
k=1 k=1 


=< So pK and th = S— Pack): 
k=1 k=1 


Since p, > 0 for all k € N, the sequences (s,,) and (t,) are increasing. As DD pe = A, 


Corollary 3.4.3 implies that s, < A for all n € N. We will now show that: _ a A for 
alln € N. Let n EN. Since {o(1), 0(2),...,0(n)} is a finite set of natural numbers, 
there is a natural number N such that o(k) < N for all k < n. So for each k < n, 
there is an i < N such that pox) = p;. Hence, by (a), 


n N 
Lg Ds Pel SS, Deon = As 
k=1 k=l 


Thus, t, < A for all n EN. Lemma 3.4.2 implies that there is a B € R such that 
lim t, = B and B < A, that is, > Pott) = B<A. 


noo 


As 3 Po(k) = B and pS pr is a rearrangement of 2 Po(k), the above argument 
k=1 =1 


implies that 3 pe = A< B. Hence, A = B. Therefore, x Po(k) = A. 
k=1 k=1 
Corollary 7.3.5. If >, gy converges to B where gq, < 0 for all k € N, then any 
rearrangement of thie pares converges to B. 
Proof. Let >> q, = B and q, < 0 for all k € N, and let o be a permutation of N. 
k=1 


Theorem 7.1.7 implies that S (—q~) = —B where —q, > 0 for all k € N. Thus, 
k=1 


: —Jo(k) = —B by Lemma 7.3.4. Hence, again by Theorem 7.1.7, 3 Go(k) = B. 
k=1 k=1 
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So, if a series has only positive terms, or only negative terms, and it converges 
to a value, then any rearrangement of this series will converge to this same value. 
Suppose that a series has an “infinite” mix of positive and negative terms. Will the 
same hold for such a series? Yes, if the series is absolutely convergent. 


Theorem 7.3.6. If = ay is an absolutely convergent series that converges to A, 


then any réariandeiient of this series converges to A. 


[o,e) 
Proof. Assume that 5° az is an absolutely convergent and converges to A, that is, 
k=1 


(A) 3 ay, = A. For each k €N, let 


k=] 
_ Jax, if ay, > 0; _ Jax, if ay <0; 
PNG. Sega ~ MET NG. rapiaasg. 


Clearly, pp > 0 and n, < 0, for all k € N. Note that (¥) ay = pp + for all k EN. 
Since >> ax is absolutely convergent, Exercise 5 on page 204 implies that (») both 


co 


> pr and >> n, converge. Let o be a permutation of N. We now have the following: 
k=1 k=1 


A= Joan = (e+) by (a) and (¥) 
k=1 k=1 
= +e PR+ Dy "k by (») and Theorem 7.1.7 
k=1 k=1 


[e,e) [e.e) 

= yp Po(k) + SS No(k) by Lemma 7.3.4 & Corollary 7.3.5 
k=1 k=1 
[ee) 


=> (pew + No(k)) by Theorem 7.1.7 
k=1 


= S- asx) by (¥). 
k=1 


We next prove a theorem which shows that the conclusion of Theorem 7.3.6 fails 
for any conditionally convergent series, a surprising result. The theorem states that 
one can rearrange a conditionally convergent series to one that converges to any 
real number of one’s choosing. The key idea behind the proof of this theorem is not 
complicated; however, the formal proof is a bit technical, in part, because one has to 
inductively define three sequences before one can identify the relevant rearrangement 
of the given conditionally convergent series. 

Before presenting the proof, we offer the following summary of the proof: Since the 
series is conditionally convergent, it must have an infinite number of positive terms 
and an infinite number of negative terms (see Exercise 5(c) on page 204). Let L > 0. 
First we choose successive positive terms from the conditionally convergent series 
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until their sum is greater than L. Then we choose successive negative terms, from the 
series, until the sum of all the chosen positive and negative terms is less than L. We 
then inductively repeat this process by adding, in the same manner, more positive 
and negative terms to the preceding sum. During this process, we also record how 
this rearrangement is being constructed. In the end, we will have a rearrangement of 
the series that converges to L. 


Theorem 7.3.7. If Ss dy, is a conditionally convergent series, then for any real 


number J, there is a ronne SuBeiNett of this series that converges to L. 


Proof. Let s ay, be a conditionally convergent series. Without loss of generality, we 
k=1 
will assume that a, 4 0 for allk € N. Let 1, p2, 93,... be all the positive terms in the 


CO 
series )> az, ordered as they occur in the series; and let 71, 72, 13,... be all the nega- 
k=1 


[oe [oe 

tive terms in py Gx, also ordered as in the series. Since )> a, converges, Theorem 7.1.8 
k=1 

implies that he sequences bP) and (7x) converge to 0. Moreover, Exercise 5(c) on 


page on 204 implies that = Pr = co and Boi Nk = —00, for each m EN. 


Let L > 0. (The aber fer L<0Ois ivenc 10.) We inductively define sequences 
(ix), (nx), and (Sp,). These sequences will allow us to identify a rearrangement of 
>> ag and a sequence of partial sums of this rearrangement. At the odd stages of the 
k=1 


[o.e) 

induction we put some of the positive terms of 5> a, into the rearrangement, and 
k=1 

at the even stages we place a some of the negative terms, of the original series, into 


[oe 
the rearrangement. Let po = 0 and 7 = 0. Since the partial sums of 5° pz are not 


bounded above and L > 0, there is an 7, € N such that 

pit pat pater + px-1 SL < pit pat pate + Pix-1 + Pir; 
and let nj = 4 and sp, = p1 + po +--+ p;,. Thus, LD < sp, and |sp, — L| < |pi,| 
(Corollary 2.2.10(1)). Since the partial sums of > 7, are not bounded below and 


k=1 
L < 8»,, there is an tg € N such that 


Sn, HM +2413 + + Nir + Mig < LS Sn, +m + M2 +3 +++ + Nig-1- 


Let ng = rn, + ig and Sp, = Sn, tM +72 +-°°+Ni,- SO Sn, < Land |s,, — L| < |n;,| 
(Corollary 2.2.10(2)). 

Let k > 2. Assume that 7,1, Ng, Sn, are defined and that s,, < L if k is even, 
and Sp, > L if k is odd. We shall now define ix41, Nk+1, Sn,,,. There are two cases 
to consider. 

CASE 1: k + 1 is odd. Since the partial sums of >> pr are not bounded above 
k=(ip-1+1) 
and s,, < L, there is a natural number i441 > %,—1 such that 


Snj, + P(in—1+1) Hg eS P(insi—1) < Le< Snz + P(in—1+1) ea P(ingi—1) + Pixgar 


Infinite Series M 211 


and let ngy1 = me + ipy1 and 


Snyyy = Sng 1 P(ix—1 +1) speeesE Plinsi—1) oF Pinar’ 


So L < 8p,,, and |s,,,, — L] < |pi,,,]. 
CASE 2: k + 1 is even. Since the partial sums of | >> 7 are not bounded below 


k=(tn-14+1) 
and L < s,,, there is a natural number 7,41 > %,—1 such that 


Sn, + Nin —1+1) eal NGn41-1) T Ning < Les Sn_ + N(in—1+1) tee NMipsi-1)> 
and let nei = Ne + ixy1 and 
Snpii = Snp ST ge eR ee Mingi-l + Mya: 


Hence, Sn,,, < Land |sn,,, — L| < |na4,|- 
The inductive definition of the sequences (i), (ng), and (Sp,) is now complete. 
As a result, for all k € N, we have that 
|Sn, — L| < |p:,| if k is odd; 


7.11 
|Sn, — L| < |m,| if & is even. ee 


The constructed series is 


PE Dig Pa Se eg Peel eee Dia agen ee og oe ieig, 472) 


which is a rearrangement of 3 dp, denoted by x g(r). Let (Sn) be the sequence 
k=1 


of partial sums of the series (7. 12), and let (s,,) be the subsequence of (s,,) that 
was defined in the above inductive definition. For example, sn, = p1 +--+: + pi, and 
Sng = Pr tre t+ pie FM +++ + Mig- 

Since the sequences (p,) and (7) converge to 0, the subsequence (s,,,) converges 
to L by (7.11). From the definition of (s,,) and the terms in (7.12), it follows that for 
all: ke N and-n.6 Nil tip SS Hee ty WHEN Sa SSeS Shae OF Sapag SS Sais Sage 


Lemma 3.3.7 implies that (s,) converges to L, that is, )> ag(n) = L 
k=1 


Thus, given any specific real number, the terms of a conditionally convergent series 
can be rearranged to obtain a series that converges to this given number. Hence, the 
sum of a conditionally convergent series depends on the arrangement of its terms. 
Whereas, a rearrangement of the terms in a absolutely convergent series, or in a finite 
sum, will have no effect on the resulting sum. Thus, the properties of addition that 
hold for finite sums do not always hold for infinite series. 


Exercises 7.3 


*1. Let N* = {0,1,2,3,4,...}. Define a strictly increasing pon w: N* > N* that 


formally verifies that > (d2~ + G2¢41) is a regrouping of » Ak. 
=0 
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*10. 


. Find a regrouping of s (— 


. Show that >> (47*r2* 4 22k+1,2k+1) is a regrouping of >> (3 + (—1)*)*r*, using 
k=0 


k=0 
Exercise 1. Let |r| < }. Given that )> (3 + (—1)*)*r* converges, find its sum. 
k=0 


1)**1 that converges to 0. Now find another regrouping 


of this series that Sanverses to 1. 


. Let s az be a series of positive terms. Let W: N > N be strictly increasing and 
k=1 


let >> by be the resulting regrouping of S> ax, as in Definition 7.3.1. Suppose 
k=1 


CO [o,e) 
that » by converges to a. Using Lemma 3.3.7, show that 5> a, converges to a. 
k=1 k=1 


. Suppose that > dr is absolutely convergent, and let > g(x) be a rearrangement 


k=1 
3 
of 2 az. Prove that . |ax| = 
k=1 k=1 k= 


: Y Jara]. 


lee) lee) 
. Let S> a, be a series and let o be a permutation of N. Prove that > a,x is 


k=1 k=1 


[oe) 
absolutely convergent if and only if >? a,(4) is absolutely convergent. 
k=1 


[oe 
. Let o be a permutation of N. Evaluate the sum of ue saiy 


. Let 3 az be divergent with nonnegative terms. Show that > Ag(z) also diverges 


k=1 
for any rearrangement. 


[oe) 
. Let >> ax be a conditionally convergent series. 


k=1 

(a) Show that there is a rearrangement of 5 ax such that the sequence of partial 
sums of the rearranged series is not Bonndéa above. 

(b) Show that there is a rearrangement of o ax such that the sequence of partial 


k=1 
sums of the rearranged series is bounded but does not converge. 


[oe 
Let S> by be a conditionally convergent series. 
k=1 


(a) Show that 3 (—bx) is conditionally convergent. 
k=1 


(b) Assume that Theorem 7.3.7 holds whenever L > 0. Show that this implies 
that for any J < 0, there is a permutation o of N such that >? bo(4) = J. 
k=1 


CHAPTER 8 


Sequences and Series of 
Functions 


In this chapter, we will investigate sequences and series of functions. We have already 
investigated sequences and series of real numbers. Each real number number that 
appears in a sequence is viewed as a distinct object. A function f is also a distinct 
mathematical object. Recall that f(a) denotes only the value of f at x. So f(x) is 
a real number, whereas the object f is not a real number; it is an operation that 
produces a real number value f(2) for each x in the domain of f. 


8.1 POINTWISE AND UNIFORM CONVERGENCE 


8.1.1 Sequences of Functions 


In Definition 3.0.1, we defined a sequence to be a function s: N > R. We will now 
extend the notion of a sequence to be any function whose domain is the set of natural 
numbers. So to obtain a sequence of functions, we need to associate a function f, to 
each natural number n. We will then let (f,) denote this sequence of functions. 
For example, for each n € N, let f,: R — R be defined by f,(x) = nx”. Then (f,) 
is a sequence of functions. In most cases, the functions in a sequence will have the 
same domain. 

Let (fn) be a sequence of functions each of which has domain D. For each x € D, 
we have that (fn(x)) is a sequence of real numbers, and it is not not a sequence of 
functions. Moreover, the sequence (f;(x)) of real numbers may or may not converge. 
However, when (f;,(x)) converges for all « € D, we can define a new function. 


Definition 8.1.1. Let (f,) be a sequence of functions that are defined on D C R. 
Then (f,) converges pointwise on D if for each x € D, the sequence (f,(x)) 
converges. Whenever (f,,) converges pointwise on D, we can define the limit function 
f: D> R by 

f(a) = lim fala) 


for each x € D, and we say that (f,) converges pointwise to f on D. 
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Example 8.1.2. Let f,: [0,1] — R be defined by f,(x) = x”, for each n € N. Then 
for each x € [0,1], recalling Corollary 3.1.12, we have that lim fn(z) = f(x) where 


0, if0<a2<1; 
i rt 
fe) . ife=1. 


? 


Observe that the limit function f is not continuous while all of the functions in the 
sequence (f,,) are continuous. In addition, each f,, is differentiable on [0,1], but f is 
not differentiable at 1. 


As Example 8.1.2 demonstrates, there are sequences of continuous functions whose 
pointwise limit is not continuous. There are also sequences of bounded functions 
whose pointwise limit is not bounded. We now define a stronger notion of convergence 
which does not have these particular shortcomings. 


Definition 8.1.3. Let (f,) be a sequence of functions that are defined on D C R, 
and let f: D — R. Then (f,) converges uniformly to f on D if for every ¢ > 0, 
there is an N € N such that for alln €N, ifn > N, then |f,(x) — f(x)| < e for all 
x € D. A sequence (fp) is said to converge uniformly on D if there is a function f to 
which (f,,) converges uniformly on D. 


There is a key difference between pointwise convergence and uniform convergence. 
For uniform convergence, given any ¢ > 0, there isan N € Nso that whenever n > N, 
we have that |f,(a) — f(x)| < ¢, not just for a specific point in D, but for all « € D; 
that is, the entire graph of f, must lie within the graphs of f — ¢ and f +6 (see 
Figure 8.1). 


Figure 8.1: If n > N, then |fn(x) — f(x)| < ¢ for all z € D. 


Remark. If (f,,) converges uniformly to f on D, then 


1. (fn) converges pointwise to f on D, and 


2. (fn) converges uniformly to f on any subset of D. 


In logical terms, (f,) converges uniformly to f on D means that 


(Ve > 0)(AN EN)(Vn € N)(Ve € D)(n > N > |fa(x) — f(x)| <6), 


and it is this logical form that motivates the following proof strategy. 
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Proof Strategy 8.1.4. To prove that (f,) converges uniformly to f on D, use the 
proof diagram: 
Let ¢ > 0 be arbitrary. 
Let N = (the value in N you found). 
Let n > N be a natural number. 
Let x € D be arbitrary. 
Prove |fn(x) — f(x)| <e. 


In a proof we may be assuming that a given sequence of functions converges 
uniformly. Our next strategy will then be useful. 


Assumption Strategy 8.1.5. When assuming that (f,,) converges uniformly to f 
on D, then for any ¢ > 0, there is an N € N such that |f,(x) — f(x)| < ¢, for all 
n>WN andallxe D. 


In the proof of our next theorem, we shall apply both of the above strategies. 


Theorem 8.1.6. Suppose that (f,,) converges uniformly to f on D. Let c € R be 
nonzero. Then (cf,) converges uniformly to cf on the set D. 


Proof. Assume that (f,) converges uniformly to f on D and c £0. Let ¢ > 0. Thus, 
there is an N € N such that for alln > N and all x € D, 


Lin(x) — F(2)| < ia (8.1) 


We will now prove that |cf,(a) — cf(x)| < ¢ for alln > N and all x € D. To do this, 
let n > N and let x € D. Hence, 

Icfn(x) — cf(x)| = lel |fn(x) — f(x)| by property of abs. value 
thaw by (8.1). 


Ic| 


Therefore, the sequence (cf,,) converges uniformly to cf. 


<|e| 


The above proof is fairly straightforward. However, to prove that a sequence of 
functions converges uniformly to a specific function using Definition 8.1.3 is typically 
difficult. Fortunately, our next theorem offers an alternative method. 


Theorem 8.1.7. Let (f,) be a sequence of functions that converges pointwise to f 
on D. For each n EN, let 8, = sup{|fn(z) — f(x)| : 2 € D}. Then (f,) converges 
uniformly to f on D if and only if lim Bn = 0. 


Proof. See Exercise 3. 


Theorem 8.1.7 can also be used to show that a sequence of functions does not 
converge uniformly to a given function. This is illustrated in our next example. 
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Example 8.1.8. For each n € N define f,: (0,1) > R by f, (a) = x”. The sequence 
of functions (f,) converges pointwise to the zero function on (0,1). Observe that 


Bn = sup{|z" — 0| : 2 € (0,1)} =1 


for each n € N. Since lim Bn = 14 0, Theorem 8.1.7 implies that (f,,) does not 


converge uniformly to the zero function on (0, 1). 


So by Example 8.1.8, pointwise convergence does not imply uniform convergence. 
We now extend the notion of a Cauchy sequence of real numbers to a sequence of 
functions. 


Definition 8.1.9. Let (f;,) be a sequence of functions that are defined on D C R. 
Then (f,) is uniformly Cauchy on D if for every ¢ > 0, there is an N € N such 
that for all m,n € N, ifn > N and m > N, then |f,(x) — fm(x)| < e for all x € D. 


One can use our next theorem to prove that a sequence of functions converges 
uniformly without knowing the purported limit function. 


Theorem 8.1.10 (Cauchy Criterion). Let (f;,) be a sequence of functions that are 
defined on D C R. Then (f,,) converges uniformly on D if and only if (f,) is uniformly 
Cauchy on D. 


Proof. See Exercises 11 and 12. 


8.1.2 Series of Functions 


Recall that an infinite series of real numbers is viewed as a sequence of its partial 
sums, where each partial sum is a real number. Similarly, an infinite series of functions 
is viewed as a sequence of its partial sums, where each partial sum is a function. Let 
(fn) be a sequence of functions. For each n € N, the nth partial sum function 


n 
Sn: D — R is defined by s,(xz) = >> f(x) for all 2 € D. The infinite series of 
k=1 
[oe 
functions >> f,, denotes the sequence (s,,) of partial sum functions. 
nS1 
Definition 8.1.11. Let t fn) be a sequence of functions defined on D C R and let 
f: D—>R. The series > fn converges pointwise to f on D if the corresponding 
sequence of partial sum F fineuions (Sn) converges pointwise to f on D. Moreover, 


3 f, converges uniformly to f on D if (s,) converges uniformly to f on D. 

We now present an important condition which will ensure the uniform convergence 
of a series of functions. The proof of this result applies Theorem 8.1.10, the Cauchy 
Criterion. 


Theorem 8.1.12 (Weierstrass M-test). Let (fn) be a sequence of functions defined 
on DCR. Let (M,) be @ sequence of real numbers such that |fn(x)| < Mn for all 


x€DandallneN. If > M,, converges, then = fn converges uniformly on D. 


n=1 
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Proof. Let (M,,) be a sequence of real numbers such that 5*> M,, converges and 


n=1 
|fn(a)| < Mn, for allz € Dandalln EN. (8.2) 
Let (s,) be the sequence of partial sum functions for the series 3) fn. We prove that 
3 fn is uniformly Cauchy on D. Let ¢ > 0. Since 3 Mn Sere Theorem 7.1.12 
apie that there is a natural number N such thee 
2 M, <e, foralln>m>N. (8.3) 


k=m+1 


Let n >m > N and « € D. We shall prove that |s,;,(z) — 5,(x)| < € as follows: 


n 

S- fe(x)| by algebra 

k=m+1 
n 


S- |fx(x)| by the triangle inequality 
k=m+1 


[Sn(#) — Sm(x)| = 


IA 


< S°> My <e_ by (8.2) and (8.3). 


k=m+1 


Thus, by Theorem 8.1.10 and Definition 8.1.11, 5° f, converges uniformly on D. 
n=1 


Example. For each n €N, let f,: R — R be defined by f,(x) = ieee Observe that 


|fn(x)| < 4 for all z € R and all n EN. Since 3 + converges by Theorem 7.2.5, 


ns ne 


[oe 
the Weierstrass M-test implies that series 5> f, converges uniformly on R. 
n=1 


Exercises 8.1 


“1. For each n EN, let fr(x) = smi) for all  € R. 


(a) Show that (f,) converges uniformly to the zero function on R. 
(b) Show that (f/) does not converge pointwise on R to the function f’. 


2. For each n EN, let f,(2) = — for all x > 0. Let f(x) = 0 for all x > 0. 


xr+n 
(a) Show that (f,) converges pointwise to f on [0, 00). 
(b) Let 6 > 0. Show that (f,) converges uniformly to f on [0, }]. 
(c) Show that (f;,) does not converge uniformly to f on [0, 00). 


*3. Prove Theorem 8.1.7 


4. For each n EN, let f,: D — R be increasing on D. Show that if (f,,) converges 
pointwise to f, then f is increasing on D. [See Theorem 3.2.13.] 
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5. 


*9. 


10. 


"11. 


*12. 


13. 


Suppose that (f,) and (g,) converge uniformly, respectively, to f and g on D. 
Prove that (fn + gn) converges uniformly to f +g on D. 


. Suppose that (f,) converge uniformly to f on D, and that g: D > R is bounded. 


Prove that (gfn) converges uniformly to gf on D. 


. Let (fn) be a sequence of functions that converges uniformly to f on D. Suppose 


that there is an m > 0 such that each |f(x)| > m and |f,(x)| > m for all « € D 


and all n € N. Prove that (}) converges uniformly to ; on D. 


. Let (fn) be a sequence of functions that converges uniformly to f on D. Suppose 


that f is bounded on D. Prove that there is an N € N such that f, is bounded 
on D, for each n > N. 


Let (fn) be a sequence of functions that converges uniformly to f on D. Suppose 
that each f, is bounded on D. 
(a) Prove that f is bounded on D. 


(b) Prove that (f,) is uniformly bounded on D, that is, prove that there exists 
areal number WM such that |f,(x)| < M for all x € D and alln EN. 


(c) Let M be as in (b). Show that |f(x)| < M for all « € D. [Apply Corol- 
lary 3.2.16.] 


Suppose (f,) and (gn) are sequences of bounded functions with domain D that 
converge uniformly on D to f and g, respectively. 


(a) Show that fn(2)gn(x)— f(x)9(@) = fn(%)(9n(x) — 9(@)) + 9(@)(fr(x) — f(2))- 
(b) Using Exercise 9, prove that (fngn) converges uniformly to fg on D. 


Suppose that (f,,) converges uniformly to f on D. Prove that (f,) is uniformly 
Cauchy. [See the proof of Lemma 3.6.3.] 


Let (fn) be a sequence of functions on D that is uniformly Cauchy. 


(a) Prove that there is a function f: D — R such that (f,,) converges pointwise 
to f. [Apply Theorem 3.6.5.| 


(b) Let ¢ > 0. Since (fn) is uniformly Cauchy, there exists an N € N such 
that for all z € D, ifn > N and m > N, then |f,(x) — fm(x)| < §. 
Let x € D. Since lim fm(x) = f(x), use Corollary 3.2.16 to show that 
|fn(x) — f(x)| < § < € whenever n > N. 


For each n € N, let fp(x) = (sin(x))?” for all x € [0, $3]. Prove that the series 


~ fn converges uniformly on (0, 3). 


n=1 


8.2 PRESERVATION THEOREMS 


As shown in the previous section, pointwise convergence does not ensure that the limit 
function will possess the same properties as the functions in a sequence. Suppose that 
a sequence of functions converges uniformly to f. What properties that are shared 
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by all of the functions in the sequence will be preserved and thus hold for f? In this 
section, we will pursue this question. 

We will first show that uniform convergence is sufficient to ensure that the limit 
function will be continuous if all the functions in the sequence are continuous. This is 
established, in the proof of our next theorem, by applying Strategy 4.1.2 on page 102. 


Theorem 8.2.1. Let (f,) be a sequence such that f,: D — R is continuous on D, 
for alln EN. If (f,) converges uniformly to f on D, then f is continuous on D. 


Proof. Let (f,) be a sequence such that f,,: D — R is continuous on D, for alln € N. 
Assume that (f,,) converges uniformly to f on D. To prove that f is continuous on 
D, let c € D be arbitrary and let ¢ > 0. Since (f,) converges uniformly to f, there is 
an N such that 


lfn(z) — f(x)| < = for allx € Dandalln > N. (8.4) 


So, let n > N. Since f, is continuous and c € D, there is a 6 > 0 such that for all 
x € D, we have 


Ifa(z) — falo)| < 5 when |a — ¢| < 6. (8.5) 
Now, let « € D be such that |x — c| < 6. We prove that | f(x) — f(c)| < € as follows: 
lf(@) — Fle)| = [f(@) — Fl) + fn() — fn(e) + frn(Q — fro) by algebra 


= |(F(2) — fr(@)) + inl) — Fl) + Fal) — fr(Q)| by algebra 
S| f() — fr(®)| + lfn(o) — FC) + |fn() — fr(o)| by triangle ineq, 


<5 +5 4+fale) — frlo)l by (8.4) 
& E E 
oor Ae i ae by (8.5). 


Thus, | f(x) — f(c)| < , if |2 —c| < 6. Therefore, f is continuous on D. 


Since a series of functions is a sequence of partial sum functions, Theorem 8.2.1 
applies to a series of functions as well. Consequently, Theorems 4.1.6 and 8.2.1 imply 
the following corollary. 


Corollary 8.2.2. Let (f,,) be such that f,,: D — R is continuous on D, for alln € N. 


If 3 fn converges uniformly to f on D, then f is continuous on D. 
n=1 

Example 8.1.2 presents a pointwise convergent sequence of continuous functions 
that converges to a discontinuous function. Theorem 8.2.1 thus implies that this 
convergence is not uniform. Furthermore, Example 8.1.8 identifies a sequence of con- 
tinuous functions that does converge pointwise to a continuous function and yet, this 
convergence is not uniform. So the converse of Theorem 8.2.1 does not hold. Under 
certain additional conditions, however, a converse does hold (see Theorem 8.2.3). 

Let (fn) be a sequence of functions defined on a closed interval [a,b]. Then (f;) 
is said to be an increasing sequence of functions if f,(v) < fr41(x) for alln E N 
and all x in [a, }]. 
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Theorem 8.2.3 (Dini’s Theorem). Let (f,,) be an increasing sequence of continuous 
functions, where f,,: [a,b] > R for alln EN. If (f,) converges pointwise to f on [a, 5] 
and f is continuous, then (f,) converges uniformly to f on [a, )]. 


Proof. See Exercises 11 and 12. 


Suppose that each f,, is Riemann integrable on [a,b], and (f,) converges pointwise 
to f on [a, 6]. In addition, suppose that f is Riemann integrable on [a,b]. Can one 
conclude that 


Exercise 10 shows that the answer, in general, is “no.” On the other hand, if (f,) 
converges uniformly to f on [a, 6], then the answer is “yes.” Our next theorem shows 
that uniform convergence implies that the limit of Riemann integrable functions is 
Riemann integrable and, in addition, it implies that the “limit of the integrals is the 
integral of the limit.” 


Theorem 8.2.4. Let (f,) be a sequence where f,: [a,b] + R and f, is Riemann 
integrable on [a,b], for all n € N. If (f,) converges uniformly to f on [a,b], then f is 
Riemann integrable on [a, b] and 


b b 
Jim, [u= [ot 
Proof. Let (fn) be a sequence of functions where f,,: [a,b] + Rand f,, is integrable for 
each n € N. Thus, each f,, is bounded. Suppose that (f,) converges uniformly to f on 


(a, b]. Exercise 9(a) on page 218 implies that f is bounded. We apply Theorem 6.1.18 


to prove that f is Riemann integrable. Let « > 0. Now let «* = 5(bsa)" Since (fn) 


converges uniformly to f on [a, }], there is an n € N such that 
|fn(z) — f(ax)| < e* for all x € [a, 8. 


Thus, 
f(x) — fn(x) < e* and fr(x) — f(x) < e*, for all x € [a, BJ. 


Therefore, 
(a) f(z) < fa(z) +e", and (m) fr(x) < f(x) +e", for all x € [a, B. 


Since f, is Riemann integrable, there is a partition P of [a,b] such that 


€ 
by Theorem 6.1.18. Inequality (4) and Exercises 3(a), 4(a) on page 161 imply that 
OF, P) SU Gul) $e (ba). (8.7) 


In addition, inequality (») and Exercises 3(b), 4(b) on page 161 imply that 


L(fn, P) < O(f, P) + e*(b— a) and thus, (8.8) 
—L(f, P) < —L( fn, P) + e*(b— a). (8.9) 
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Adding inequalities (8.7) and (8.9), we obtain from (8.6) the following result 


U(f, P) — Lf, P) < U(fas-P) — fn, P) +22" (b—@) < 5 +2e"(b—a) = He =e. 


So U(f, P) — L(f, P) < ¢ and Theorem 6.1.18 implies that f is Riemann integrable. 
We now a that Jim i i ce. Let ¢ > 0 be given. Let e* be such that 


(W0<eF< = Since ( f,) converges uniformly to f on [a,b], there is an N € N 
such that for ab n > N, 


|fn(x) — f(a)| < &* for all x € [a, b]. (8.10) 


We conclude that 


b b b 
fn- / = (fn — r| by Corollary 6.2.6 
a 
b 
< / lfn — Fl by Corollary 6.2.17 
a 
< i. e* by (8.10) and Corollary 6.2.8 
a 


=e«*-(b—a) <e_ by Exercise 2 on page 161, and (¥). 


Hence, 


— fis < ¢ for all n > N. Therefore, Jim fo fa = Sof. 


Lemma 6.2.4 and Theorem 8.2.4 imply the following corollary. 


Corollary 8.2.5. Let (f,) bea sequence such that f,: [a,b] > Rand f, is Riemann 
integrable, for each n € N. If 2 fn converges uniformly to f on [a,b], then f is 


>) 


Exercise 1 on page 217 identifies a sequence (f,,) of differentiable functions that 


Riemann integrable and 
1 a 


uniformly converges to a differentiable function f; however, as stated in the exercise, 
the “derivative sequence” (f/) does not converge pointwise to f’. One way to avoid 
such results, is to deal with derivative sequences that converge uniformly and consist 
of continuous functions. 


Theorem 8.2.6. Let (f;,) be a sequence such that f,: [a,b] > R for all n € N. 
Suppose for each n € N we have that 


(1) fn is differentiable on [a,b], and 
(2) f/, is continuous on [a, 0]. 


If (fn) converges pointwise to f on [a,b] and (f/) converges uniformly on [a, 6], then 
f is differentiable on [a,b] and (f/,) converges uniformly to f’. 
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Proof. Let (fn) be a sequence such that f,: [a,b] + R for each n € N. Suppose that 
the above (1) and (2) hold for every n € N. Assume that (f,,) converges pointwise 
to f on [a,b] and that (f/) converges uniformly to g: [a,b] > R. As f/, is continuous 
for all n € N, Theorem 6.3.1 implies that each f/ is Riemann integrable on [a, x] for 
each x € [a,b]. Thus, Theorem 8.2.4 implies that g is Riemann integrable on |a, z] 
for every x € [a,b]. Moreover, Theorem 8.2.4 implies that 


[o- jim, ie (8.11) 


for all x € [a,b]. Because (f,,) converges pointwise to f, equation (8.11) and Theo- 
rem 6.4.2 (Fundamental Theorem of Calculus I) imply that 


[9 = Bm. Unl@) = fal) = F@) - F@ 


for all x € [a,b]. So, 


a= ([s) + f(a), for each x € [a, b). 


Theorem 8.2.1 implies that g is continuous on [a,b]. Theorem 6.4.4 (Fundamental 
Theorem of Calculus II) thus implies that f is differentiable and that f’(a) = g(x) 
for all  € [a,b], as f(a) is a constant. Since (f/) converges uniformly to g and f’ = g, 
we conclude that (f/) converges uniformly to f’, as required. 


Corollary 8.2.7. Let (f,) be a sequence such that f,: [a,b] — R for all n € N. 
Suppose for each n € N we have that 


(1) fn is differentiable on [a, b], and 
(2) f/ is continuous on [a, 6]. 


CO [o.e) 
If 5° fp converges pointwise to f on [a,b] and >> f/ converges uniformly to f’ on 
tL n=1 


[a, 6], then f is differentiable on [a, 6], and for all x € [a, 6], f’(x) = s JE): 
n=1 


Exercises 8.2 


1. For each n EN, define f,: [0,1] > R by fr(x) = sin(ne) +h 
(a) Show that (f,) converges pointwise to f, the zero function on [0, 1]. 
(b) Using Theorem 8.1.7, show that (f,) converges uniformly to f on [0, 1]. 


) 
(c) Does (f/) converges uniformly to fr on [0,1]? 
(d) Using Theorem 8.1.12, show that >» fn converges uniformly on [0, 1]. 
(e) Is 3 fn continuous on [0, 1}? 
n=1 


(f) Does 3° f/, converge uniformly on [0, 1]? 


n=1 
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. Prove Corollary 8.2.5. 


3. Prove Corollary 8.2.2. 


4. Let f,: D — R be uniformly continuous on D, for all n € N. Prove that if (fn) 


*10. 


*11. 


*12. 


converges uniformly to f on D, then f is uniformly continuous on D. 


. Let (fn) be a sequence of functions that are continuous on [a,b]. For each n € N, 


define F,,: [a,b] > R by F,(2) = J” f. Suppose that (F;,) converges uniformly 
on [a,b]. Prove that (f,) converges uniformly on {a, }]. 


. For each n EN, define f,: [0,1] > R by fr(xz) = a +1. 


(a) Show that (f,) converges pointwise to f, the constant 1 function on [0, 1]. 
(b) Using Theorem 8.1.7, show that (f,,) converges uniformly to f on [0, 1]. 
(c) Does (f/) converges uniformly to f’ on [0,1]? 


. Let (fn) be such that f,: D — R is continuous on D, for all n € N. Suppose that 


(fn) converges uniformly to f on D. Let x € D and let (x%,) be a sequence of 
points in D. Prove that if lim Ln = xz, then lim fa(tn) = f(x). [Theorem 8.2.1 


implies that f is continuous and thus, lim Ff tn = Fes] 


. Prove Corollary 8.2.7. 


. Let (fn) be a sequence of functions that are Riemann integrable on [a, b]. Suppose 


that (f,) converges uniformly to f on [a,b]. For each n € N, define F;,: [a,b] > R 
by F,,(x) = J” f. Prove that (F;,) converges uniformly on [a, )]. 


For each n € N define fr: [0.1] > R by 


eel 
0, otherwise, 


fa(z) = fe 1), f 1 <2<, 


for all x € [0,1]. Show that (f,) converges pointwise on [0,1] to f, the zero 
function on [0,1]. Each of the functions f, is Riemann integrable on [0,1], and 
so is the function f. Show that lim 4 fn F ii f. 


Let (fn) be an increasing sequence where f,,: [a,b] + R for all n € N. Suppose 
that (fn) converges pointwise to f on [a, }). 


(a) Show that f,(x) < f(x) for all n € N and all z € [a, d). 

(b) Let n € N. Show that f(x) — fr(x) > f(x) — fr(x) for all x € [a,b] and all 
k>n. 

(c) Prove that (f,) converges uniformly to f on [a,b] if and only if for every 
€ > 0, there is an n € N such that f(x) — fr(x) < e for all x € [a, BJ. 


Let (fn) be an increasing sequence where f,,: [a,b] + R is continuous on |a, db], 
for all n € N. Suppose that (f,,) converges pointwise to f, where f is continuous 
on [a,b]. Prove Theorem 8.2.3 as follows: Suppose, for a contradiction, that (f,) 
does not converge uniformly to f on [a,b]. Now, justify the following steps: 
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(a) There is an ¢ > 0 so that for every n € N, there is an x € [a,b] such that 
f(x) — fn(x) = € (see Exercise 11(c)). 

(b) There is a sequence (x,,) such that x, € [a,b] and f(t) — fn(tn) > €, for 
every n EN. 

(c) There exists a subsequence (%p,) of (Zn) that converges to a point x* € [a, b]. 

(d) For each n EN, f(2n,) — fn(tn,) > f(tn,) — fng(2n,) = € for all k > n. 
(see Exercise 11(b)). 

(e) For each n EN, f(x*) — fn(a*) > € (see Exercise 9 on page 74). Therefore, 
(fn) does not converge pointwise to f. 


8.3 POWER SERIES 


We will now continue our study of infinite series of functions; however, in this section 
we will examine the infinite series of power functions, that is, functions that have the 
form a,(x —c)". Such a series is said to be a power series. A given power series may 
converge for some values of x and it may diverge for other values. In this section will 
present tools that will allow us to identify the particular values for which the power 
series converges. In the next section, we will address the question of representing a 
specific function as an infinite series of power functions. 


Definition 8.3.1. Let (an)7°.) be a sequence of real numbers and let c € R. For each 
n EN, let fr: I > R be defined by f,(x) = an(x% — cc)”, where J is an interval. Then 


fn is called a power series in (x — c) and is also denoted by 
1 


n= 


S- tat) = S- ax(a — c)* = ap + a(x — c) + ao(x — c)? + a3(2 — cc)? +--+. (8.12) 
n=1 k=0 
The numbers ag, a1, a2, ... are called the coefficients of the power series, and the 


number c is referred to as the center of the power series. 


A power series in (x — c) can be viewed as an “infinite” polynomial in (x — c). 


lee) 
When c = 0, a power series takes on the form S> agv*® = ap +a,x+agr?+a3x?+---. 
k=0 


Intervals of Convergence 


Given a power series in (x — c), for what values of x will the series converge? It 
seems likely that the answer to this question depends on the coefficients of the power 
series. Our next theorem will address this question. First, we need to slightly extend 
the definition of the limit superior of a (bounded) sequence given in Section 3.8.1. If 


a sequence (x) is not bounded above, then we shall write limsup x, = ov, in this 
N—- Oo 


specific case. In addition, we shall write x < oo to mean the «x is a real number. 
The term \/z is undefined when x > 0. So whenever a term of the form ¥/x appears 
below, we will be implicitly assuming that k > 1. 
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Theorem 8.3.2. Let S> a,(x —c)* be a power series, £ = limsup ¥/Jag], and let 
k=0 k-+00 


0, if€=a; 
p=45, #£0<£< 00; (8.13) 
coo, iff= 


(1) If p =0, then the series converges only for x = c. 


(2) If0<p< cand |x —c| < p, then > ay(x — c)* converges absolutely. 


(3) If 0 < p< ow and |x —c| > p, then 3 ay(x — c)* diverges. 
k=0 


(4) If p = co, then - az(x — c)* converges absolutely for all real numbers 2. 
k=0 


Moreover, if 0 < ¢ < p, then >> ag(a — c)* converges uniformly on [c — ¢,c + ¢]. 
k=0 


Proof. Let €=limsup ¥/|a,|. First we note that for any x € Randk EN, 
k- 00 


t/Jax(x — )*| = |x — e| ¢/ axl. (8.14) 


If p = 0, then limsup ¢/Ja;z] = 00, by (8.13). Thus, (4/[axl) is not bounded above. If 
k-00 


x # c, then from (8.14) we see that ( \/ [an (x — c)*|) is also not bounded above. By 


Theorem 7.2.12(2), S> az(x—c)* diverges. Thus, the series converges only for x = c. 


Suppose that 0 < p< co. So p= ; By Exercise 10 on page 98, we have 


1 
lim sup ¢/|az(a — c)*| = |x — c|limsup *|a,| = |z — ¢| — 8.15 
SUD | =) |e eh UD |e : (8.15) 
for any x € R. By Corollary 7.2.13, 3° az(x —c)* converges absolutely if |a — c| < p 
k=0 

and diverges if |a —c| > p. 

Suppose that p = oo. Thus, lim sub */|ax| = 0. So, lim sup */ |ax(a — c)*| = 0 for 

k> kh 


all x € R. Thus, the series converges absolutely for all a € R, by ree 7.2.13(1). 
Let0<¢ <p. If xe [e—¢,c4+ , then |x —c| < ¢ and |a;,(a — c)*| < |ax|C*, 


: : k k ¢ 1 
for all k € N. Moreover, as in (8.15), we have that peep AC ag Ge oe Hence, 


by Corollary 7.2.13(1), 3° |ag| ¢* converges. Theorem 8.1.12 implies that the series 
k=0 


y> ax(x — c)* converges uniformly on the closed interval [c — ¢,c + ¢]. 
k=0 


'Replace . with 0, if p = co 
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The value p, defined in (8.13), is called the radius of convergence. If p 4 0, 
then by Theorem 8.3.2 the set of points x such that > az(x — c)* converges is an 


interval. This interval, called the interval of piwversenek is determined as follows: 


1. If 0 < p < ow, then the series converges absolutely for all x in the open interval 
(c — p,c + p), and diverges for all x outside the closed interval [c — p,c+ p]. In 
addition, the series converges uniformly on any closed subinterval of (c— p,c+p). 
To identify the interval of convergence, one must first check the endpoints « = c—p 
and x = c+, separately, to determine convergence or divergence. If any endpoint 
converges, then the interval of convergence is obtained by adding such an endpoint 
to the interval (c — p,c + p). If no endpoint converges, then (c — p,c + p) is the 
interval of convergence. 

2. If p = o, then the series converges for all « € R; in this case, (—oo, +00) is 
the interval of convergence and oo is the radius of convergence. Furthermore, the 
series converges uniformly on any closed interval [a, }]. 


Remark. When p is a radius of convergence, we will write p > 0 to mean that 
p € Rt orp=oo. If p= ooandce€ R, we will interpret (c— p,c+ p) to be (—oo, ov). 


When applying Theorem 8.3.2, the value @ = limsup ¥/|a;| may be difficult to 
k- 00 
evaluate. In this case, Corollaries 3.8.18 and 3.8.5 offer another way to find @. 


Theorem 8.3.3. Let S> ax(a — c)* be a power series where az 4 0 for all k > 0. If 
k=0 


is either a real number or infinity, then lim sup ¥/Ja,] = lim |“ 
k- 00 k-0o0 


ak 


Qk+1 


lim 
k-o0o 


Example 8.3.4. Consider the power series be * (a — 3)*. By Corollary 3.2.10 and 


some algebra, we have that 


an m Vk= 5, 


lim sup ¢ 5 ! 
ee 


k-> oo 


Qk 


and thus, by Theorem 8.3.2, the radius of convergence of the power series is 2. Alter- 
natively, we can apply Theorem 8.3.3 to obtain the same result, namely, 


ko] Be | boo DRL & 2” 


In any case, we see that 2 is the radius of convergence and the power series converges 
for all x in the interval (1,5). For « = 1 and x = 5, we obtain the respective series 


y(-1 )*k and ys 


k=0 


which both diverge. Therefore, (1,5) is the interval of convergence. 
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loc) 
Let 3° az(x —c)* be a power series with interval of convergence J. One can now 
k=0 
define a function f: J > R by 


co 


f(x) = S/ ag(x — c)* = a9 + a(x — c) + ao(a —c)* +--- 
k=0 


Can f be differentiated term by term, as a polynomial can be differentiated? 
[oe) 

Theorem 8.3.5. Let S> a,(a—c)* be a power series with radius of convergence p > 0, 
k=0 


let I be the interval (c—p,c+p), and let f: I > R be defined by f(x) = . a;,(x—c)*. 


Then f is continuous and differentiable on J. Moreover, for all x € J, 
f= S- kax(x — c)*-1 (8.16) 
k=1 
and the power series > ka,(ax — c)*~1 also has radius of convergence equal to p. 
k=1 
Proof. As p > 0, (8.13) implies that @ = lim sup {/|a,| is a real number. Theorem 3.8.6 
k—+00 
and Corollary 3.2.10 imply that 


lim sup */|ka,| = lim Wk - lim sup (/ax| = é. 
k-00 k—-00 k—->00 


loc) 
Thus, >> kaz(x — c)*-! also has p as its radius of convergence (see Exercise 5). 
k=1 


Let I = (c— p,c+ p), and let f: I > R be defined by f(x) = S ax(x — c)*. To 


prove (8.16), let « € J and let [a,b] C I be such that x € (a,b). Theorem 8.3.2 implies 


that 3 az(a—c)* and 3 kax(x2 —c)*~! converge uniformly on [a,b]. Corollary 8.2.7 
k=0 k=1 


implies that f’(7) = 3, ka,(x —c)*~? for all x € [a,b]. Thus, f is differentiable on I 


and hence, is continuous on I by Corollary 5.1.3. 


Consider the power series in Example 8.3.4, with radius of convergence 2 and 
interval of convergence (1,5). Using this power series, we define f: (1,5) > R by 


ie 1 2 re. 4 ote 
F(a) = D0 ala - 3)" =0+ 5(2 3) + ale 3) t og (2 3)° + ale 3)* 4 


Theorem 8.3.5 implies that f’: (1,5) — R exists and is equal to the power series, 
with radius of convergence 2, given by 


f(a) =S (ea) = + Sle 3) + 
k=1 


228 @ Real Analysis: With Proof Strategies 


Since the power series (8.17) has radius of convergence 2, Theorem 8.3.5 implies that 
f”: (1,5) > R exists and is equal to the following power series 


f" (a) = k ee ze _ Bie 2 
k=2 


that also has radius of convergence 2. One can repeat Theorem 8.3.5 to obtain, for 
any n € N, a power series which is equal to f), on the interval (1,5), where tide: 
is the nth derivative of f. The following theorem also follows by repeatedly applying 
Theorem 8.3.5. 


Theorem 8.3.6. Let S> a,(a—c)* be a power series with radius of convergence p > 0. 
k=0 


Let I be the open interval (c—p,c+p), and define f: J > R by f(x) = 3 a;,(x—c)*. 


Then, for all n € N, the nth derivative f(™: I 4 R exists and satisfies 
fc =D SDs Dag or 


= nla, + 3 k(k=1) (kan + 1agle= oe), 
k=n+1 


with radius of convergence p. So f\”)(c) = nlan, for all n = 0,1,2,.... 


Remark. The term by term derivative of a power series > a,(a —c)* will have its 


starting value changed to m-+1 only if the original power sane starts with a constant 


term. For example, by differentiating 3 a,(a —c)*, we obtain 3 apk(a —c)*-1; but, 
k=0 k=1 


after differentiating 3 a;(x — c)**1, we obtain S an(k + 1)(2—c)*. 
k=0 k=0 

We say that a function f is infinitely differentiable at a point x, if for all 
n €N, the nth derivative f(” (x) exists. If f is infinitely differentiable at all points in 
an open interval J, then f is said to be infinitely differentiable on J. Theorem 8.3.6 
shows that a function that is defined by a power series, with radius of convergence 
p > 0, is infinitely differentiable on an open interval. 

Theorem 8.3.6 implies a uniqueness result concerning power series. Suppose that 
two power series have the same center and interval of convergence. Our next corollary 
shows that if these power series have equal values on this common interval, then they 
must be the same power series. 


Corollary 8.3.7. Let 3 az(x — c)* and > b.(x — c)* be power series with radius 
k=0 k=0 


of convergence p > 0. Suppose that S* ax(a — c)* = D> by (x — c)* for all x in the 
k=0 k=0 
interval (c — p,c+ p). Then ax = bx for all k > 0. 
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Proof. Let I = (c— p,c+ pe). Assume that 3 ay(x —c)F¥ = S by(x — c)* for all x 
k=0 k=0 


in I. Let f: I > R be defined by f(x) = 3 ax(x —c)*. Thus, f(x) = S b.(x — c)* 
k=0 


for all x € I, as well. So, f(c) = ao = bo (see (8.12)). Theorem 8.3.6 implies that 
IMO = nian = nb, and so an = by, for all n > 1. Hence, ay, = by for all k > 0. 


Using Corollaries 8.2.2, 8.2.5, and Definition 6.2.12, the proof of Theorem 8.3.5 
can be adapted to prove the next theorem. 


Theorem 8.3.8. Let >> a;,(a2—c)* be power series with radius of convergence p > 0. 
k=0 
Let I be the interval (c—p,c+p), and let f: I > R be defined by f(x) = SD az(x—c)*. 
k=0 
Then f has an antiderivative on J. Moreover, for all x € J, 


ee ye cht} 


[oe 
ak k+1 . 3 ere 
and Pa pay (4 — c)"*" has p as its radius of convergence. 


Exercises 8.3 


[oe 
1. Show that the power series >> = converges for all x. 
k=0 — 


2. Find the radius of convergence of the power series: }> (3 + (—1)*)¥a*. 
k=0 


3. Find the interval of convergence of the power series: (a) p> Kia’ s*(b) So a. 
= k=0 


4. Suppose that for all z € R, bs az(x — c)* converges if and only if Da b, (a — c)* 


converges. Show that ae aeuP y lag] = ae a eup VALE 


*5. Let c € R and suppose that ¢ = lim sup ¥/|a,| is a real number. 
k- oo 


(a) For all x € R, show that hoy sup ee |ag| la — ¢|* + = |a—e 8. 
[Note: If « 4c, then (a — on -@&s =! . See Corollary 3.2.9.] 


(b) Conclude that 3 ax(x — c)* and s kax(x — c)*~* have the same radius of 
k k 


convergence. [See the proof of Theorem 8.3.2.] 


6. Find the radius of convergence of the power series 2 oo (2 — 2)*. 


7. Suppose that s ay,(x—c)* converges for all x € R. Prove that lim sup ¢/Ja;] = 0. 
k=0 k-00 
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[oe 
8. Suppose that S* a,x" has radius of convergence 2. Let n € N. Find the radius 
k=1 


lee) [oe [oe) 2 
of convergence of (a) > afx*, (b) > aga”, (c) XS apa*. 
k=1 k=1 k=1 


9. Prove Theorem 8.3.8. 
10. Suppose that a function f is defined by f(z) = > K(x —1)*. What is the 
domain of f? Find f(°)(1). 7 


11. Find the radius of convergence of > — * and find the sum of this series. Now 
find th f 3° k@=3" and th ee 
nd the sum o p>» 5¢ — and the sum o > ase 


8.4 TAYLOR SERIES 


In the previous section, we started with a power series centered at a point c with a 
radius of convergence p > 0. We then used this power series to define a function f 
and show that it is infinitely differentiable at all the points in an interval determined 
by c and p. In this section, we will attempt to reverse this process; that is, we will 
start with a function that is infinitely differentiable on an open interval and try to 
find a power series that is equal to the function on this interval. 


Definition 8.4.1. Let f be a function that is defined on an open interval I that 


k 


[oe 
contains c € I. Suppose that >> ag(a — c)" is a power series that converges for all 
k=0 


cell f(x) = 3 a,(a—c)* for all 2 € I, then we say that f is represented by the 
k=0 


CO 
power series )> a(x —c)*. We also say that f has a power series representation. 
k=0 


Let f be a function that is infinitely differentiable at a point c. We would like to 


find a power series )> a,(x—c)* such that f(x) = 0 az(x—c)* for all x in some open 
k=0 


[oe) 
interval I containing c. How can one find such a power series? If f(2) = > ax(x—c)* 


for all x in J, then Theorem 8.3.6 implies that f(c) = kla, and thus, az, = ro 
for all k > 0. Hence, we have found the only candidate; that is, if f has a power series 


: 3 F fF) (c) k 
representation, then this power series must be >> “{7“(x£ — c)". 
k=00 


Definition 8.4.2. Let f be a function that is infinitely differentiable on an open 
interval J containing c. The Taylor Series for f centered at c is 


oo £(k)(¢ / Wo B)(¢ 


where the coefficients are called the Taylor coefficients. When c = 0, the Taylor 
series is called the Maclaurin Series for f. 
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As noted above, if a function f has a power series representation centered at c, 
then that power series representation must be the Taylor series for f. On the other 
hand, there are infinitely differentiable functions f that do not have a power series 
representation and thus, the Taylor series for f will not be equal to the function f (see 
Example 8.4.11). In order to ensure that a function is equal to its Taylor series, the 
function must satisfy some additional conditions. Before identifying these conditions, 
we need to revisit a topic discussed in Section 5.3. 

Let f: J > R be infinitely differentiable on an open interval J containing c. For 
allnée Nanda e J, let 


Pe) = 3 u 7 (2 — c)* and R,(x) = f(x) — Pa(z) (8.18) 


where P, is called the nth Taylor polynomial of f at c, and R, is said to be the 
remainder term. The remainder term measures the difference between f(x) and 
P,(x). We can now present a condition, which when satisfied, ensures that a function 
is equal to its Taylor series. 


Theorem 8.4.3. Let f: J > R be infinitely differentiable on an open interval I 
containing c. Let x € J. Then f(x) = 3 L°O (2 —c)* if and only if lim 7) =U, 


Proof. For each n € N, note that P,(x) is the nth partial sum function for the Taylor 
series 5° LPO (zo) (see Definition 8.1.1). Thus, for all x € I, we have the following 
ko 


equivalences: 


a= 3 u kl (2—c)* iff f(z) = im Pt) by Definition 7.1.1 


iff jim (f(x) — P,(x)) =0 by Theorem 3.1.10 


iff Jim A, (a) =O by (8.18). 


Theorem 8.4.3 shows that a function f equals its the Taylor series on an interval I 
precisely when the sequence of remainder terms (R,,(x)) converges to 0, for all x € J. 
However, when trying to prove that the sequence of remainder terms converges to 0, 
the definition of R,(a) in (8.18) is not very useful. Fortunately, there is another way 
to evaluate this remainder term. Taylor’s Theorem 5.3.2 implies that for all n € N, 
there is a point d between c and x such that 


(n+1) 
F(@) = Pala) + ie —c)"", (8.19) 
Since R,,(x) = f(x) — P,(x), Equation (8.19) implies that 
(n+1) 
Rls) = ae er ae (8.20) 


We now present a result which can be used to show that the remainder terms will 
converge to 0. 
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Corollary 8.4.4. Let f: J — R be infinitely differentiable on an open interval I 
containing c. If there is an M > 0 such that Fig) < M forallt € JandallneEN, 
then the Taylor series for f, centered at c, is equal to f at all points x € J. 


Proof. Let M be such that Fro| < M for all ¢ € J and all n € N. Thus, for all 
x €lIandallne€N, there is adc€ J such that (see equation (8.20)) 


(n+1) n 
\f @l = os geal ea ae 


0< |Fl@)l= Ear ~ (n+)! 


(8.21) 


Exercise 16 on page 79 implies that lim Mp = 0, for all x € I. So lim Ae p= 
0 for every x € I by (8.21). Therefore, by Theorem 8.4.3, the Taylor series for f, 


centered at c, is equal to f at all points x € I. 


Example 8.4.5. Let f: R — R be defined by f(x) = e”, the natural exponential 
function. We now apply Corollary 8.4.4 to show that f equals its veceunn series for 


all z € R. Since f((0) = 1 for all integers n > 0, we see that Dae 7, is the Maclaurin 


series for f. By applying Exercise 12 on page 204, one can  yenily that the series 
converges for all x. Let m € N and M = e™ > 0. Since f(™(t) = e' < M for all t in 


the interval (—00,m). Corollary 8.4.4 implies that e” = Ss w for all x € (—oo, m). 


lee) 

Since this holds for any natural number m, it follows that e* = >> 7 for all real 
k=0 — 

numbers x. Substituting « = 1 into this Maclaurin series representation, we see that 


oy a ee ee 
= 2g at ga a 


In Examples 8.4.6-8.4.10 below, by applying differentiation, integration, substi- 
tution, or algebra on a given Taylor series representation, we obtain the Taylor series 
representation of a new function without having to compute its Taylor coefficients. 


Example 8.4.6. Let f: (—1,1) — R be defined 2 oh = x. Theorem 7.1.11 


implies that a = ¥* a* for all x € (—1,1). Thus, -. = equals its Maclaurin series for 
k=0 
all x € (—1,1). Since f’(x) = aap a Theorem 8.3.5 implies that = = = =e kek-} 


for all « € (—1,1). Thus, 3 ka*-} is the Maclaurin series representation of the 


function ia As fo f =In(1—2), we have that [> f = 34 


aa (= 1,1), by 


Theorem 8.3.8. Thus, Pe #**" is the Maclaurin series representation of the function 


In(1 — x) on the hana ha 1 ay 
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Example 8.4.7. In Example 8.4.5, it shown that e*? = >> = for all real numbers z. 
k=0 — 
Since —2? € R for all x € R, we see that 


x*, for allz eR. 


oy 
8, 
T 
Me 
at 
18 
pat) 
T 
Me 
— 
aL 
= 


k=0 k=0 
£0 k 
Thus, ze~™ = > eye, for alla ER 
k=0 


Example 8.4.8. In Example 8.4.6, it shown that ;— = => r* for all x € (—1,1). 


Observe that 1 — (—z) = 1+ a. Since -1 < -x < 1Lif mee only if -1 <a <1, we 
see that 


1 [o,e) [oe) 
= SS (-2)* = S°(-1)*2*, for all x € (—1,1). 
DE 5 k=0 
Hence, 773 = 3 (—1)¥a**!, for all « € (—1,1). 


Example 8.4.9. In Example 8.4.6, it shown that — = s x* for all x € (—1,1). 
k=0 


Since —1 < x? < 1 if and only if —1 < x < 1, we see that 


1 [oe) 
ae Soa for all x € (—1,1). 
—f2 = 


» ee = © o**1 for all « € (-1,1). 
k=0 


aaa 8.4.10. In Example 8.4.8, we obtained the Maclaurin series for the function 
f(x) = gap, that is, f(x) = >> (-1)*2* for all x € (—1,1). Since fT f = n(1 + 2), 


we see that 


(=I) at 
In(h-e ay = S- ae” +? for all x € (—1,1). 

We end this section by identifying an infinitely differentiable function which is 
not equal to its Taylor Series. So not every infinitely differentiable function has a 
Taylor series representation. 


Example 8.4.11. Let f: R — R be defined by 


eV/@" if x £0; 
re) = {5 PE: 


This function is clearly infinitely differentiable at x whenever x 4 0. Moreover, it is 
infinitely differentiable at 0. For example, let us evaluate f’(0): 


f'(0) = lim f(x) — f) ie gale 


= 0 
x0 x —O0 z>0 6 
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where the latter equality follows from Example 5.2.14 on page 142. In a similar 
manner, using Exercise 17 on page 147, one can show that f)(0) = 0 for all k EN. 


Thus, S f(0)a* = 0 for all x, which is not equal to f(a) whenever x F 0. 
k=0 


Exercises 8.4 


1. 


. Show that sin(x) = ye 


As shown in Example 8.4.5, e” = s = for all real numbers x. Show that the 
k=0 


term by term derivative of . ee is equal to 


x 
kl: 
k=0 k=0 


52k +1, for allx ER. 


Gee 


3. Using Exercise 2, find the Maclaurin series for x cos(zx). 


11. 


12. 


13. 


. Using Maclaurin series for 


. Express the integral fh e~* dx as an infinite series. 


ett 


. Using the Maclaurin series for In(1 — x) in Example 8.4.6, evaluate 2 tr 


. Find the sum of the series s ka® and 5 k?x*, when z is in (—1, 1). 


k=1 k=1 


. Find the sum of the series 3> 2*+!, when « is in (—1,1). Now find the sum of 


k=0 
the series 2 8 ROOF ane 


i in Example 8.4.7, find the Maclaurin series 


y and its interval of convergence. 


for eens 


. Find the Taylor series for e” centered at 1. 


10. 


Let p € Rt andc ER. Let I = (c—p,c+ pp) and f: I — R. Suppose that 
f(x) = X a,(x — c)* if and only if 2 € I. Prove that lim sup */Ja,] = + 


k=0 k- oo 


pail the Maclaurin series for in Example 8.4.8, find the Maclaurin series 


ne 
for + centered at 1 and over the interval (0, 2). 


Let f(z) = >> apx* for all x such that |z| < p, were p € R*. Suppose that 
k=0 
f(x) = f(—2x) whenever |z| < p. Prove that a, =0 for every odd integer k. 


Suppose that the power series >= ap(x — c)* has radius of convergence p > 0. 


Let I be the interval J = (c— me Cc si p), and let [a, 6] C J. Explain why the power 
series converges uniformly on |a, )]. 


APPENDIX A 


Proof of the Composition 
Theorem 


Theorem 6.2.14 (Composition Theorem). If f: [a,b] — [c, d] is Riemann integrable 
and g: [c,d] > R is continuous, then (go f): [a,b] > R is Riemann integrable. 


Before proving this theorem, we present some technical notation and two technical 
lemmas that will be used in the proof. 


Definition A.1. Let f: [a,b] + R be bounded and P = {20,21,%2,...,%n} bea 
partition of [a,b]. Let M; = M;(f) and m; = m,(f) for i=1,...,n. Given 6 > 0, let 


As = {i: M; —m, < 6} and Bs = {1 : M; — m; > 6}. (A.1) 


Let As and Bs be as defined in Definition A.1. Then 


Ap Bea 4 273) yout and As Bs = ©. (A.2) 
Lemma A.2. Suppose that f: [a,b] — R is bounded. For every 6 > 0, whenever 
P = {20,%,22,...,%pn} is a partition of [a,b] such that 


then 5° Az; < 6 where B; is as defined in (A.1). 

1€ Bs 
Proof. Let f, 6, and P be as stated. Assume that P satisfies (A.3). Let As and Bs; 
be as in Definition A.1. From (A.2) and (A.3), we see that 


UGE, P) _ ET; P) = s(M, _ m,)Ax; = .3 (M; _ m)Ax; + S- (M; _ m,)Ax; < 6°. 


i=1 iE As iE Bs 


Therefore, > (M; — m;)Ax; < 67. Since 6 < M; — m; for all i € Bs, we conclude 
iE Bs 
that 3> 6Ax;< >> (M;—m;)Az; < 6. Thus, > Az; < 6. 


1€ Bs 1€ Bs iE Bs 
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Lemma A.3. Let f: [a,b] > [c,d], g: [c,d] > R, <’ > 0, and 6 > 0. Suppose that 


for all y, y’ € [c,d], if |y — y'| < 6, then |g(y) — g(y’)| < &. (A.4) 
Then for any partition P = {zo, 271, %2,...,2n} of [a,b], we have that 
Yo (Mi(ge f) — mi(go f)) Ani < e(b — a), 
i€ As 


where As is as in Definition A.1. 


Proof. Let f: [a,b] > [c,d], g: [c,d] > R, & > 0, and 6 > 0. Assume (A.4). Let 
P = {x0,21,...,2n} be a partition of [a,b]. Let M? = Mi(go f), m? = mi(go f), 
M; = M,(f), and m; = m;,(f) for each i = 1,2,...,n. Let As = {i : M; — m; < 6}. 


Claim. M? — m? < é’ for alli € Ay. 


Proof of Claim. Let i € As and let S = {g(f(x)) : x © [xi-1,2;]}. Observe that 
M? = sup(S) and m? = inf(S). Let x,2’ € [x_1,2;|. Hence, m < f(x) < M; 
and m; < f(2’) < Mj. So |f(x) — f(z')| < M; — mj. Since i € As, we see that 
| f(z) — f(a’)| < 6. Thus, (A.4) implies that |g(f(x)) — g(f(2’))| < e’. Therefore, 
M? — m? < é’ by Theorem 2.3.18. 


The Claim implies that 


S> (MP —m)Ax; < oa e'Ag; = ¢! S- Ag; < e'(b—a), 


1€As5 1€E As 1E As 


where the last inequality holds because 5*> Ag; < So Ax; = b-a. 
ic As i=1 


A Proof of the Composition Theorem 


Proof of Theorem 6.2.14. Let f: [a,b] — [c,d] be integrable and g: [c,d] + R be 
continuous. To prove that (go f): [a,b] — R is integrable on [a,b], we shall apply 
Theorem 6.1.18. Let ¢ > 0. Since g is continuous on [c,d], we see that g is bounded. 
Let K = sup{|g(x)|: x € [c,d]}. Since a < band K > 0, there is an e’ > 0 such that 


e'(b—a+2K) <e. (A.5) 


By Theorem 4.5.5, g is uniformly continuous on |c, d]. So, there is a 6 > 0 such that 
6<e' and 


for all y,y’ € [c,d], if |y — y’| < 6, then |g(y) — g(y’)| < &. (A.6) 
Since f is Riemann integrable on [a, 6], let P = {x0,%1,%2,...,%n} be a partition 
such that 
Let As = {1 : M; —m, < 6} and Bs = {1 : M; — m; > 6} where M; = M;(f) and 


m; = mj(f) for each i = 1,...,n. Now, let M? = M;(gof) and m? = m,(go f), for all 


Proof of the Composition Theorem MH 237 


i=1,...,n. Since K = sup{|g(x)|: x € [c,d]}, it follows that —K <m?< MP <K 


and thus, 


Me ie < 26 for t= Noes 


eG (A.8) 


We now show that U(go f, P) — L(go f, P) < € as follows: 


U(go f,P) — L(go f, P) 
= S7(M? — m?)Ani 
i=1 


Do (MP — m)) Aa; + >) (MP — m2) Az; 


iE As iE Bs 
<e'(b—a)+ S- 2K Ax; 
iE Bs 
=e'(b—a)+2K S- Ag; 
1EBs 
< e'(b—a) + 2K6 
< (b—a)+2Ke' 


e'(b—a+2K) <e 


Therefore, go f is Riemann integrable on [a,b], by Theorem 6.1.18. 


by Lemma 6.1.7 

by (A.2) 

by (A.6), Lemma A.3, (A.8) 
by algebra 


by (A.7) and Lemma A.2 
since 6 < e’ 


by algebra and (A.5). 


Taylor & Francis 
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APPENDIX B 


Topology on the Real 
Numbers 


In this appendix, we will explore certain topological properties that concern certain 
subsets of R. This involves extending the basic properties possessed by open intervals 
to a broader collection of sets of real numbers. We will do the same for closed intervals. 
These generalized properties are called open, closed, and compact. 


B.1 OPEN AND CLOSED SETS 


We begin by considering some relationships that hold between an individual real 
number and a set of real numbers. Recall that a neighborhood of a point « € R is an 
open interval of the form (x —¢,x+¢), centered at x, for some ¢ > 0. To simplify the 
notation, we will denote the neighborhood (x — ¢,x + €) by U2. We will also write 
U* to denote a neighborhood of x when it is not important to identify e. 

Now let x € R and let S CR. Think of x as being a point on a map and the set 
S as being a region on the map. The next definition gives meaning to the expressions 
“gz is inside of S” and “x on the boundary of S.” 


Definition B.1.1. Let S be a set of real numbers. 


1. The point « € R is an interior point of S if there is a neighborhood U® such 
that U® C S. The set of all interior points of S is denoted by int(S). 

2. The point x is a boundary point of S if every neighborhood U* contains points 
in S and points not in S. The set of all boundary points of S is denoted by bd(S). 


If x is an interior point of S, then « € S. Whereas, a boundary point of S may 
be or may not be a member of S. 


Example. We identify the sets int(.S) and bd(S) for the following five sets: 


1. S = (0,3): int(S) = (0,3) and bd(S) = {0}. 
2. S=N: int(S) = @ and bd(S) =N. 
3. S = {4:n€N}: int(S) = @ and bd(S) = SU {0}. 
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4, $=Q: int(S) = @ and bd(S) = R. 
5. S={qeEQ:0<q< 3}: int(S) = @ and bd(S) = [0,3]. 

The concept of a topology on the set of real numbers involves certain subsets of 
R that are decreed to be “open sets.” An open set of real numbers is an important 
concept in mathematics. The collection of all open sets is called a topology. Many 


topics that we previously studied (such as convergence and continuity) can be defined 
entirely in terms of open sets. 


Definition B.1.2. Let S be a set of real numbers. 


1. S is an open set if int(S) = S, that is, for every x € S' there is a neighborhood 
U®* such that U® C S. 


2. S is a closed set if R \ S is an open set. 


One can view a set S as being open, if every point in S is completely surrounded 
only by points in S. 


Comment. An open set S contains none of the boundary points of S. A closed set 
S contains all of the boundary points of S. 


Comment. The sets @ and R are both open and closed sets. These are the only 
subsets of R that have both of these properties. 


We now show that an open interval is an open set. 


Theorem B.1.3. Let a,b € R be such that a < b. Then the open interval (a, b) is 
an open set. 


Proof. Let x € (a,b), and so, a < x < b. Let e = min{x—a, b—}. We will prove that 
U® C (a,b). Let y € UZ be arbitrary. Thus, |y — 2| < ¢. Since ¢ = min{z — a,b — =z}, 
we see that |y — z| < s—aand |y—2| < b—«. Thus, 


e-y<x-aandy—“<b—-«z. 


Hence, a < y and y < b. Therefore, a < y < 6. 


Thus, in particular, every neighborhood is an open set. 

Theorem B.1.4. Let a € R. The open intervals (—oo, a) and (a, +00) are open sets. 
There is a simple relationship between open sets and closed sets. 

Theorem B.1.5. Let S be a set of real numbers. 


(a) The set S is open if and only if R \ S is closed. 
(b) The set S is closed if and only if R \ S is open. 


We will now explore how the set operations affect open and closed sets. For 
example, is the union or intersection of two open sets also open? What about the 
union or intersection of any family of open sets? We note that the operations UJ F 
and (|F are defined in Section 1.2.5. 
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Theorem B.1.6. Let F be a family of open sets. 


(a) Then U F is an open set. 
(b) If F is finite and nonempty, then ()F is an open set. 


Proof. Let F be a family of open sets. 
(a). Let c € UF. Thus, there is an O € F such that x € O. As O is an open set, 
there is a U® such that U® C O. Since O C UF, it follows that U* C UF. 
(b). Let F = {O1,...,On} where n € N. Let x € (.\F. Thus, x € O; for all 
O; € F. As every O; is an open set, there is an ¢; > 0 such that Us © O;, Let 
= Min{ Fins, Say > 0. AS UP CUP CO; tor all Oye F, we have UP Cif. 


Corollary B.1.7. Let G be a nonempty family of closed sets. 


(a) Then ()G is a closed set. 
(b) If G is finite, then UG is a closed set. 


Proof. Let G be a nonempty family of closed sets. 
(a). By Theorem ??(2), we have that (A) R\QG = U{R\ B: B € G}. Since 
R \ B is an open set for each B € G, Theorem B.1.6(a) and (A) imply that R\ NG 
is an open set. Hence, (|G is a closed set. 
(b). See Exercise 6. Hint: Use Theorem B.1.6(b) and Theorem 1.2.11(1). 


Theorem B.1.8. Let a,b € R. Then (—oo, a) U (b, +00) is an open set. 


Proof. By Theorem B.1.4, (a,-+oo) and (—oo, a) are open sets. Thus, Theorem B.1.6 
implies that (—oo, a) U (b, +00) is an open set. 


Theorem B.1.9. Let a,b € R be such that a < b. Then [a, }] is a closed set. 


Proof. Since R \ [a, 6] = (—oo, a) U (b, +00), Theorem B.1.8 tells us that R \ [a, 5] is 
an open set. Therefore, [a,b] is a closed set. 


B.1.1 Continuity Revisited 


For x € R, recall that U® denotes an open interval (a — p,x + p) for some p > 0. 
As alluded to earlier, the continuity concept can be defined entirely in terms of open 
sets. This is established by the following two theorems. 


Theorem B.1.10. Let f: D— Randce D. The following are equivalent: 

(1) f is continuous at c. 

(2) For every US there exists a US such that f[U°N D] C USO, 

(3) For every US there exists a U° such that USN D C fo Ufo]. 

Proof. Let f: D — R be a function and let c € D. Items (2) and (3) are equivalent 
by Exercise 13 on page 22. So we will show that (1) and (2) are equivalent. 


(1) > (2). Let f be continuous at ¢ and let U/( be neighborhood of f(c). Since f is 
continuous at c, there is a U° such that f[U°M D] C US (see Figure 4.1). 
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(2) = (1). Assume that (a) for each US there is a US such that f[USN D] C UFO). 
Let ¢ > 0. S0 UL isa neighborhood. By (A), there is a Us so that f[UsSQ D] C uf, 
Thus, for all x € D, if |x — cl < 6, then | f(x) — f(c)| < ¢. So f is continuous at c. 


Theorem B.1.11. A every function f: D —> R, the following are equivalent: 


(1) f is continuous. 
(2) For every open set V there exists an open set U such that UM D = f71[V]. 
(3) For every closed set B there exists a closed set A such that AN D= f~1![B]. 


Proof. Let f: D — R be a function. 
(1) > (2). Assume that f is continuous. Let V be an open set. Define the set F by 


U° €F iffce fo '[V] and USN DC fo [UF] for some UL C V. (B.1) 


By Theorem B.1.6(a), U = UF is an open set. We now prove that UN D = f-1[V]. 
(C). Let x € UND. So x € USND for some US € F. Thus, by (B.1), we have that 
t€UCnDC f-[UL] where USO CV. Since f- [UFO] C f-1[V] (see Exercise 9 
on page 22), we conclude that x € f~![V]. Hence, UN D C f-"[V]. 
(>). Let c € f-"[V]. So c € D and f(c) € V. Since V is an open set, there is 
a neighborhood US C V. Theorem B.1.10(3) implies there exists a U° such that 
USD C f- [UF]. Thus, by (B.1), US € F. Soc EC UND. Hence, f-'[V] C UND. 


(2) + (1). Assume (2). Let ¢ € D and let US be a neighborhood of f(c). By (2), 
there is an open set U such that UM D = f-'[Uf]. Since f(c) € UFO, we infer 
that c€ UN D. Since U is an open set, there is a US such that US C U. So 


UNDCUND= f[UrO). 


Thus, by Theorem B.1.10(3), f is continuous at c. Therefore, f is continuous. 
For the equivalence of items (2) and (3), see Exercise 5. 


B.1.2 Accumulation Points Revisited 


Let S CR. Recalling Definition 3.5.2, a point x € R is an accumulation point of S if 
every neighborhood of x contains an infinite number of points from S. That is, for all 
neighborhoods U”, the set SMU” is infinite. A point x € R is called an isolated point 
of S if « € S and x is not an accumulation point of S. The set of all accumulation 
points of S is denoted by S$". 


Comment. A point x is not an accumulation point of S if there is a neighborhood 
U* such that the set SU” is finite. 


Definition B.1.12. Let S C R. The closure of the set S is defined by cl(S) = SUS". 
Lemma B.1.13. Let S C R. Then cl(S) is a closed set. 


Lemma B.1.14. Let S C R, x € R and x ¢ S. Suppose that every neighborhood 
of x contains a point from S. Then every neighborhood of x must contain an infinite 
number of points from S. 
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Proof. Let S CR, x € Rand x ¢ S. Suppose that 
every neighborhood of x contains a point from S. (A) 


We will prove that every neighborhood of « must contain an infinite number of points 
from S. Suppose, for a contradiction, that some neighborhood of x, say U? (where 
€ > 0), contains only a finite number of points from S. So, UP NS = {x1,22,..., Un} 
where n € N. Let e* = min{|z — |, |x — r2|,...,|2 —an|} > 0. Because x ¢ S, 
it follows that e«* > 0. Now, since e* < ¢€ it follows that UX C UZ; and hence, 
UZ0S CU?FNS. Thus, U29S C {x1,%2,...,¢n}. In addition, since e* < |x — a; 
for each i < n, it follows that 7; ¢ U% for alli <n. So, URNS = @. Therefore, U2. 
is a neighborhood of x which contains no points from S. This contradicts (A). 


Theorem B.1.15. Let S C R. Then S is closed if and only if S contains all of its 
accumulation points. 


Proof. Let S be a set of real numbers. 


(=). Assume that S is closed; thus, R\S is open. Let x be an accumulation point of S. 
Suppose, for a contradiction, that x ¢ S. Since R\S' is open, there is a neighborhood 
U* such that U® contains no points from S. However, since x is an accumulation 
point of S, U* contains an infinite number of points from S. 


(<=) Assume that S contains all of its accumulation points. Let « € R \.S. We must 
show that there exists a neighborhood of « which contains no points in S$. Suppose, 
for a contradiction, that every neighborhood of x contains some points from S. Thus, 
by Lemma B.1.14, every neighborhood of x contains an infinite number of points 
from S. So x is an accumulation point of S. Hence, by assumption, x € S. 


Theorem B.1.15 and Theorem 3.5.4 imply the following useful equivalence. 


Theorem B.1.16. Let S C R. Then S closed if and only if for every sequence (s,,) 
such that s, € S for alln EN, if lim Sn = c¢, thence S. 


Exercises B.1 


1. Let A be open and B be closed. Prover that A \ B is open and B \ A is closed. 
2. Let S CR. Prove that bd(S$) is a closed set. 

3. Let S C Rand let F = {A: 5 C Aand A is closed}. Prove that cl(S) =(\F. 
4 


. Let f: D> R be a function where D C R. Let A C R and B C R. Show that 
(R\ A)N D = f-1[B] if and only if AN D = f-![R \ B). 


*5. Using Exercise 4, complete the proof of Theorem B.1.11 by showing that items 
(2) and (3) are equivalent. 


*6. Prove Corollary B.1.7(b). 
7. Prove Lemma B.1.13. 
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8. Prove Theorem B.1.16. 
9. Let f: D— R be continuous and let a € R. 


(a) Show that there is an open set U such that UM D= {x € D: f(x) > a}. 
Observe that f~'[V] = {x € D: f(x) > a}, where V = {yER: y >a}. 

(b) Show that there is an open set U such that UN D= {x € D: f(x) < a+l1}. 

(c) Show that there is a closed set A such that AN D= {x € D: f(x) > a}. 

(d) Show that there is a open set U such that AN D= {x € D: f(x) # a}. 

(e) Show that there is a closed set A such that AN D= {xe D: f(x) =a}. 


10. Let f: D— Rand g: D> R be continuous. Show that there is an open set U 
such that UN D= {xe D: f(z) # g(x)}. 


B.2 COMPACT SETS 


Closed intervals are very important in real analysis. Theorem 4.4.8 shows that the 
continuous image of a closed interval is a closed interval. Moreover, Theorem 4.5.5 
states that a continuous function on a closed interval is uniformly continuous. These 
theorems may not hold if one replaces “closed interval” with “open interval.” What 
is so special about closed intervals? Closed intervals are compact. As we will see, the 
previously cited theorems hold if we replace “closed interval” with “compact set.” 

Before we can define the concept of a compact set, we need to discuss the notion 
of an open cover. 


Definition B.2.1. Let S CR. An open cover of S is a family F of open sets such 
that S C UF, that is, F covers S. If F’ is a finite subset of F such that S C UF’, 
then F’ is called a finite subcover of S. 


Example. Consider the subset S = [2,7] of R. For each n € N, let O, = (1,7 — +). 
In addition, let O = (6,8). Let F = {O,O 1, O2, O3,...}. Thus, F is an open cover of 
S because S C UF. Let F’ = {O, Oz}. Note that OUO2 = (6,8)U(1,6+ 4) = (1,8). 
Thus, S C UF’ and F’ is a finite subcover of S. 


Example. Consider the subset S = (2,7) of R. For each n € N, let O, be the open 
interval O, = (1,7 — +). Let F = {O, : n € N}. Then § C VU O, and so F is an 
neN 


n 
€ 
open cover of S. But any finite subset of F, say, {On,,On,,---,On,} will not cover 
S, because for m = max{nj1,Nn2,...,nx} 


1 
Om, U Ong U---U On, = (1,7 - =) 


and $ = [2,7) Z (1,7— +). Therefore, F has no finite subcover of S. 


m 


Definition B.2.2 (Compactness). A set S C R is said to be compact if every open 
cover of S has a finite subcover. 


It follows, vacuously, that the empty set is compact. 
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Lemma B.2.3. Suppose that CC R is a nonempty, closed, and bounded set. Then 
a = inf(C) and 6 = sup(C) exist. In addition, a € C and 8 € C. Hence, a = min(C) 
and 8 = max(C). 


Proof. Let CC R be nonempty, closed, and bounded. By the completeness axiom, 
GB = sup(C) exists. We will now show that 6 € C. Suppose, for a contradiction, that 
B € C. Since R \ C is open, there is a neighborhood U® such that U2? AC = @. 
Thus, there is an 2 € U® such that x < 6 and for all y, if x < y < 8, then y ¢ C. 
So, x < 6 and s < z for all s € C. But this contradicts that fact that @ is the least 
upper bound for C. Thus, 6 € C and so 8 = max(C). The argument proving that 
a = inf(C) exists and a = min(C), is similar. 


Theorem B.2.4. Let C C R be closed and bounded. Then C is compact. 


Proof. As @ is compact, let C C R be nonempty, closed, and bounded. Let F be an 
open covering of C. For each x € R, define C, = {z € C:: z < x}. Consider the set 


S = {x €R:F has a finite subcover of C,}. (B.2) 


Claim 1. S is nonempty. 


Proof of Claim 1. By Lemma B.2.3, min(C) = m € C. Thus, C,, = {m}. Since F is 
an open cover of C,, there is an O € F such that m € F’.. Therefore, {O} is a finite 
subcover of C,. Hence, m € S and S 4 @. This completes the proof of Claim 1. 


Claim 2. S is not bounded above. 


Proof of Claim 2. Suppose, for a contradiction, that S is bounded above. Thus, by 
the completeness axiom, 3 = sup(S) exists. Therefore, 


(Vy > 6) (F does not have a finite subcover of Cy) (B.3) 
(Vax < 6) (F does have a finite subcover of C;). (B.4) 


(To see that (B.4) holds, note that if x < y, then C; C Cy.) Since either 6 ¢ C or 
8 € C, we consider these two cases. 


Case 1: 8 ¢ C. Since R \ C is open, there is a U® such that U? NC = @. So, there 
exists x,y € U® such that x < 8 < y and for all z, if x < z < y, then z € U® and 
hence, z ¢ C. Thus, C,; = C,. Now since x < (, (B.4) implies that F has a finite 
subcover of C,. So F has a finite subcover of Cy. As y > §, this contradicts (B.3). 


CASE 2: 6 € C. Since F is an open cover of C, there is an O € F such that 6 € O. 
Since O is an open set, let U? be such that U? C O. So, there exists x,y € U® such 
that x < 6 < y and for all z, if < z<y, then z € O. As x < 8, (B.4) implies that 
F has a finite subcover F’ of C,. Hence, F’ U {O} is a finite subcover of Cy. Since 
y > GB, this contradicts (B.3). This completes the proof of Claim 2. 


Therefore, S is not bounded above. Since C’ is bounded, let « € S' be such that 
z <a for all zg € C. Therefore C = C, and thus, F has a finite subcover of C. 
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Theorem B.2.4 implies that a closed interval is a compact set, as is the union of 
finitely many closed intervals. Thus, we can now present and prove generalizations of 
Theorems 4.4.8 and 4.5.5. 


Theorem B.2.5. If f: D — Ris continuous and D is compact, then f|D] is compact. 


Proof. Let f: D — R be continuous and D be compact. Let G be an open covering 
of f[D]. Let F be defined by 


U € F iff U is open and UN D = f~'[V] for some V € G. (B.5) 
Claim. F is an open covering of D. 


Proof of Claim. Let c € D. Thus, f(c) € f[D]. Since G covers f[D], there isa V € G 
such that f(c) € V. Theorem B.1.11 implies that there is an open set U such that 
Un D= f-1[V]. Thus, U € F and, asc € f~"[V], we see that c € U. Hence, F is 


an open covering of D. This completes the proof of the claim. 


Since D is compact, let F’ = {U,, U2,...,Un} C F be a finite subcover of D. For 
each U; € F’, let V; € G confirm that U; € F according to (B.5), the definition of F. 
Now let G’ = {Vi, Vo,..., Vn} C G. We now prove that G’ covers f[D]. Let y € f[D] 
and let c € D be such that y = f(c). Since F’ covers D, let U; € F’ be such that 
c € U;. Since U; 1 D = f7"[Vj], it follows that c € f—1[V;]. Thus, y = f(c) € Vi. 
Therefore, G’ covers f|D] and f[D] is compact. 


Theorem B.2.6. If f: D — R is continuous and D is compact, then f is uniformly 
continuous on D. 


Proof. Let f: D — R be continuous, D be compact, ¢ > 0, and e* = 5. Define F by 
Us € F iffce Dand flUS ND] CU. (B.6) 


Let c € D. Since f is continuous, Theorem B.1.10(2) implies that there is a US such 
that f[U§ NA D| C ul. Let 6 = £. So d = 26 and c € Uf € F. Thus, F is an 
open covering of D. As D is compact, let F’ = {U;', Us?,...,Us"} © F be a finite 
subcover of D. Let 6 = min{61, 62,...,6n} > 0. Let 2, y € D be such that |x — y| < 0. 
Since F’ covers D, we have that x € Us’ for some Us' € F’. Thus, |x — c| < 4;. Since 


lo, — y| < |; — a] + la — y| < 6, +6 < 6; + 6; = 26;, 


we see that y € U3;. We also have that « € U3;. Thus, by (B.6), we have that 
| f(x) — f(c)| < e* and |f(y) — f(c)| < e*. Hence, 


If(2) — FW) S If) — Fla) + If) — fall < +e" =e. 


Therefore, f is uniformly continuous on D. 


Exercises B.2 


1. Show that each of the following subsets S of R are not compact by finding an 
open cover of S which has no finite subcover. 
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1,8 = 0,3). 3. S = {4: ne N}. 
2.9 =N, 4.S={qeEQ:0<¢< V2}. 
2. Let (s,) be a sequence that converges to c. Show that {c, 51, s2,...} is compact. 


3. Let kK be a compact set and let (s,) be such that s, € K for all n € N. Show 
that (s,) has a subsequence that converges to a point in K. 


4. Let K be a compact set and let S C K bea closed set. Prove that S is compact. 
5. Let K be a compact set and let S be a closed set. Prove that K MS is compact. 
6. Let K be a compact set and let S be an open set. Prove that K \ S is compact. 
*7. Let F be a nonempty set of compact sets. Prove that ()F is compact. 

8. Let F be a finite set of compact sets. Prove that UJ F is compact. 


B.3. THE HEINE-BOREL THEOREM 


In this section we present a theorem that provides an alternative characterization 
of compactness. This theorem was discovered during the time when mathematicians 
were developing a logically sound foundation for real analysis. Much of this work was 
influenced by Cantor’s work in set theory and topology. 


Theorem B.3.1 (Heine-Borel). Let S C R. Then S is compact if and only if S is 
closed and bounded. 


Proof. Let S C R. Theorem B.2.4 shows that if S is closed and bounded, then S' is 
compact. So we just need to prove the converse. To do this, assume that S' is compact. 
We will prove that S' is closed and bounded. 


Claim 1. S is a closed set. 


Proof of Claim 1. To show that S' is closed, we will show that R\ S is an open set. 
So let x € R\ S. For each n EN, the set 


1 
Ut, ={yeR:|e-yl <=} 


is a neighborhood of x. We will prove that there is an n € N such that i/n CR\S. 
For each n € N, let O, be the open set defined by 


1 
On={yeR: |x —y| > \. 
n 
Note that for all m,n EN, 
m <n implies Om C On. (B.7) 


Let F = {O, : n € N}. Since x ¢ S, we see that S C UF. So F is an open cover 
of S. As S is compact, there is a finite F’ C F that covers S. Let F’ = {On,,.--On,} 
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be such that S C UF’. Let n € N be the largest of the natural numbers ny,..., 7. 
Thus, by (B.7), UF’ = On. Hence, S C On. AS On M Ulin = ©, it follows that 
Lin CR\ S. Hence, R \ S is an open set. This completes the proof of Claim 1. 


Claim 2. S is a bounded set. 


Proof. We will show that there is an n € N such that S C (—n,n). For each n €N, 
let O, be the open interval O, = (—n,n). Note that for all m,n €N, 


m <n implies O, © On. (B.8) 


Let F = {O, : n € N}. Since S C R, it follows that S C UF. Since S is compact, 
there is a finite F’ C F that covers S. Let F’ = {On,,...On,}. Let n € N be the 
largest of the natural numbers nj,...,. Thus, UF’ = O,, by (B.8). Hence, S C On; 
that is, S C (—n,n). Thus, S is bounded. This completes the proof of Claim 2. 


Claim 1 and Claim 2 complete our proof of the Heine—Borel Theorem. 


The Heine-Borel Theorem is a topological tool that can be used to produce 
different proofs of several of the theorems previously established in this text. To 
confirm this assertion, we now give a topological proof of Theorem 3.5.3. 


Theorem 3.5.3 (Bolzano—Weierstrass for Sets). Let S C R be infinite. If S is 
bounded, then there is a point x € R such that x is an accumulation of S. 


Proof. Let S be an infinite bounded subset of R. So, there exists a,b € R such that 
SC |a, b]. We shall now show that there is an x € [a, b] which is an accumulation point 
of S. Suppose, for a contradiction, that for all x € [a,b], x is not an accumulation 
point of S. So for all x € [a,b], there exists a neighborhood of x, say U*, such that 
SM U* is finite. Now consider the following set of open sets 


F = {U* : x € [a,b] and SM U® is finite. } 


Hence, F is an open cover of [a,b]. By Theorem B.1.9, [a, b] is a closed and bounded 
subset of R. Thus, the Heine-Borel Theorem implies that [a,b] is compact. As F is 
an open cover of [a, b], let F’ = {U™,U™,...,U} C F be a finite subcover of [a, 6]. 
So 

[a,6) 6 | JF =O" OU U0 U0 


and, because S' C [a, b], we have that 
Se Peer Use Ue 


Therefore, 


§ =(SNU™)U(SNU™)U---U(S AU"). 


Since each SU is finite, we conclude that S' is a finite union of finite sets. Thus, 
S is a finite set. Contradiction. 


The Heine-Borel Theorem also offers a much easier way to show that a set is 
compact. 
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B.3.1 The Finite Intersection Property 


Definition B.3.2. A family of sets K has the finite intersection property if the 
intersection of any nonempty finite subset of K is nonempty. 


We now present a generalization of Theorem 2.5.2, the nested intervals theorem. 


Theorem B.3.3. Let K be a nonempty family of compact sets. If K has the finite 
intersection property, then (\K 4 @. 


Proof. Let K be a nonempty family of compact sets and with the finite intersection 
property. As each K € K is compact, we see that K is closed and bounded by the 
Heine-Borel Theorem B.3.1. So for all K EK, K=R \ K is an open set. Define the 
set F = {Kk : K € K}, which consists of open sets. We see that (a) R\ K = UF, 
by Theorem 1.2.11(2). Let Ko € K be a fixed element of K. So Ko is a compact set. 

We will now prove that ()K 4 @. Suppose, for a contradiction, that (VK = @. 
From (A) we see that R = UF. So Ko C UF. Since Ko is compact, there is a finite 
F' CF such that (») Ko C UF’. Let 


G={Kp}U{K: Ke F'}. 


Since G is finite, G C K, and K has the finite intersection property, there is a y such 
that (W) y € ()G. So, in particular, y € Ko. Thus, by (»), y € K for some K € F’. 
Hence, kK € G and y ¢ K. This contradicts (W). Therefore, ()K 4 2. 


B.3.2 The Cantor Set 


We end this appendix, by establishing the existence of a remarkable set called the 
Cantor set. This set was introduced by Georg Cantor in 1884 and has many curious 
properties, some of which will be discussed here. 

In order to identify the Cantor set, we must first construct a nested set of compact 
sets {K,, : n € N} where K, D Ky4, for all n € N. We being this construction by 
letting Ay = [0,1]. Divide [0,1] into three subintervals of equal length and remove 
the middle open interval (5, $), and let Ky = [1, $] U [§, 1]. Now divide each of the 
intervals that comprise K into three subintervals of equal length and remove all of 
the middle open intervals, to obtain 


1 2 3 6 7 8 
= [0.35] [gaega|Y [gorge] U [ge] 

Continuing in this manner, we obtain Ky, Ko, K3,...,Kn,..., where K, D Ky4, for 
all n € N (see Figure B.1). We make an initial observation: Every K,, is the union of 
2™-! many disjoint closed intervals each of length =. Now let K = {K,:n € N}. 

The Cantor set, which we denote by C, is the set C=()K. 
Since K,, is a finite union of closed intervals, we see that each Ky, is compact. 
Thus, Exercise 7 on page 247 implies that C is compact. Since K is nested, it follows 
that K has the finite intersection property. Thus, the Cantor set C = ()K is nonempty 
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K, ° 1 
Ky ° 1/3 2/3 1 
K; 0 1/9 2/9 1/3 2/3 7/9 8/9 1 
Toh te aes eo 2, pas a 


Figure B.1: Constructing the Cantor Set. 


by Theorem B.3.3. However, the fact that C is nonempty is not surprising, since the 
endpoints that appear in each K,, are all in C; that is, 


1212 
727 2° 9’ Q’ 


7 8 
0,1 go C. 
One might conjecture the following: Every point in C appears in the above countable 
list of endpoints. As we will see, this conjecture is actually false. 

We can now pose a question: Does the Cantor set C contain any intervals? This 
seems unlikely, but to verify that C contains no intervals, we must provide a proof. 
Let « € C and y €C be such that x < y. If (x,y) CC, then (x,y) C K,, for every 
n EN. Let n € N be such that = <y-—wx. Since K, is the union of disjoint closed 
intervals of length ot we cannot have that (x,y) C K,. So C contains no intervals. 
For this reason, C is said to have no interior points. 

We can now pose another question: What is the total length of all the intervals 
removed in the process of constructing C? Note that 


e K» was obtained by removing one interval of length 3 
e k3 was obtained by removing two intervals of length 3, 
e k4 was obtained by removing four intervals of length me 
e Ks was obtained by removing eight intervals of length z- 
Continuing in this way, we see that the total length of the removed intervals is 
1 


1 1 1 OA ONE z 
+2 4+ 4 48 ee = = 3. =]. 
Be Bes Be os clas 25g) 1-3 


Thus, in the construction of the Cantor set, the total length of all of the “middle” 
open intervals removed is 1. Since the Cantor set consists of the points in [0,1] that 
remain after removing all of these intervals, we conclude that the “length” of the 
Cantor set must be 0. We know that a point has zero length. Thus, the Cantor set is 
“small,” because it has the same length as a point. On the other hand, one can show 
that C is uncountable. So the Cantor set is large in the sense of cardinality, but small 
in the sense of length. 

A infinite binary sequence is a function a: N - {0,1}. Thus, we can write a as 
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Figure B.2: Intervals in Z° where a = (1,0,1,0,...). 


(a(1), a(2), a(3),...) where a(n) = 0 or a(n) = 1, for all n € N. Such sequences can 
be used to identify the elements in the Cantor set. 

Each point of the Cantor set lies in the intersection of an infinite nested family 
of closed intervals. To illustrate this assertion, let a = (1,0,1,0,...) be a specific 
infinite binary sequence. Think of the number 1 as representing “the right subinterval” 
and 0 as “the left subinterval.” Using the sequence a, we can define a nested set of 
closed intervals in stages. This can be viewed in Figure B.2. We begin by letting 
If = [0,1]. Since a(1) = 1, we let I$ = [2,1] which is the right subinterval of [?. 
As a(2) = 0, we next select I$ = [2,4], the left subinterval of I$. Since a(3) = 1, 
let [7 = (22, 4], the right sabia of J3'. Continuing in this way, we obtain the 
ae set Z° = {I? : n € N} of closed intervals where each interval [¢ has length 
. Since [° C K,, for all n € N, we see that (.\Z* C [.)K = C. We know by 
Teen 2.5.2 that (\Z° 4 @. In fact, (.)Z° has exactly one element which we will 
denote by ca. Thus, a acts as a sequence of instructions which allows one to acquire 
a point cq in C, the Cantor set. 

So any infinite infinite binary sequence a@ can be viewed as an address to an 
element cq € C, and the associated set of closed intervals Z* = {[* : n € N} can also 
be viewed as a road map by which to locate c,. Moreover, if a and 6 are two distinct 
infinite binary sequences, then c, # cg. In addition, for each point c € C there is 
an infinite binary sequence a such that c = cg. These observations imply that C is 
uncountable and that C has the same cardinality as R [2, Chapter 5]. So the Cantor 
set C contains no intervals, has zero length and yet, it has as many points as the 
set of all real numbers. One can also show the every c € C is an accumulation point 
of C (see Exercise 9). So C has no isolated points. These are a few of the paradoxical 
properties that are possessed by the Cantor set. 


Exercises B.3 


1. Let K = {A, : n € N} bea nested family of nonempty sets; that is, A, A @ and 
Anti © An for alln EN. 


(a) Show that K has the finite intersection property. 
(b) For all n EN, let A, = (0,4). Show that K = {A, : n € N} is a nested 


family of nonempty sets. By (a), K has the finite intersection property. Now 
evaluate ()K. Does your answer contradict Theorem B.3.3? 
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*9. 


. Let f: D— R be continuous where D is compact. Prove that f is bounded. 


. Let f: D — R be continuous where D 4 @ is compact. Prove that there is a 


c € D such that f(x) < f(c) for all x € D. 


. Let f: D > R be continuous and D is compact. Let G be a nonempty set of 


closed subsets of f[D]. Suppose that G has the finite intersection property. Prove 
that there exists a c € D such that f(c) € 1G. 


. Let a be an infinite binary sequence and let Z* = {[¢ : n € N} be the associated 


nested set of closed intervals (see Section B.3.2). Let (s,) be a sequence of real 
numbers such that s, € [® for all n € N. Prove that (s,,) is a Cauchy sequence. 


. Let a be an infinite binary sequence and let Z* = {I¢ : n € N} be the associated 


nested set of closed intervals (see Section B.3.2). Show that there cannot be two 
distinct points in ()Z°. 


. Let a and £ are two distinct infinite binary sequences. Show that cq 4 cg. 


. Let c € C, the Cantor set. Inductively define an infinite binary sequence @ such 


that c € I? for all n € N. Conclude that c= cg. 


Let c € C, the Cantor set. Let a be such that c = cy. Let ¢ > 0 and n € N be 


such that a4 < e. Prove that USC is infinite. 


APPENDIX C 


Review of Proof and Logic 


LOGIC: THE BASIS FOR PROOFS 
Some Logical Connectives 
Given a list of “propositions” or statements P,Q, R,... we can form sentences using 


the logical connectives A, V,7. For example, 


1. PAQ (means “P and Q” and is called a conjunction). 
2. PV Q (means “P or Q” and is called a disjunction). 


3. =P (means “not P” and is called a negation). 


Using these connectives as building blocks, one can construct more complex sentences, 
for example, (P A =Q) V (AS A R). 


Truth Tables 


The truth value of a sentence of the propositional logic can be evaluated from the 
truth value of its components. We shall explain what this “means” by using truth 
tables. The above logical connectives have the natural truth values given by the 
following tables, where 7’ means “true” and F’ means “false.” 


P 

oo 7 hee eal ae P|-=P 
oF || Te Fl | ae 

Fe AEN) | Fume bial me Fell 

F F| F F F\| F Bo 


Logical Equivalence 


Definition. Let ~ and y be sentences of propositional logic. Then 7 and y are 
logically equivalent, denoted by w = y, if they are both true at the same time 
and both false at the same time. 


Example. Let ~ be the sentence =(PV Q) and let vy be the sentence =P \ =Q. Show 
that ~ and ¢ are logically equivalent, that is, show =(P VQ) @= =P A -7Q. 
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Solution. It is sufficient to show that truth values of =(P V Q) and =P A =Q are 
always the same. This is done by constructing the following parallel truth tables: 


Truth table for =(P V Q): Truth table for =P A -=Q: 
POOPY O | SEPN OQ} POG) | SP a? | a AA: 
T T T P T T| F | F P 
T F T P T F| F | T P 
F T T P FT| T | F P 
FF F T FF) T | T T 
Step #] 1 1 2 3 Step #]/1 11] 2 2 3 


Since the final columns of the truth tables for =(P V Q) and =P A 7Q are the same, 
we conclude that =(P VQ) @ AP A -7=Q. 


The Conditional and Biconditional Connectives 


The Conditional Connective. Given two propositions P and Q, the conditional 
connective — means “implies,” and can be used to form the sentence P > Q. This 
sentence can be read as “if P, then Q.” Given two propositions P and Q, the sentence 
P—> Q has the following truth table: 


P Q|P73Q 
T T T 
T F F 
FT T 
F F T 


Definition. Let P and Q be statements. The formula =-Q — —P is called the con- 
trapositive of P > Q. 


Definition. Let P and Q be statements. The formula Q — P is called the converse 
of P> Q. 


Note: the two statements P > Q and Q > P are not equivalent. This can be 
shown by comparing their truth tables: 


P Q|P73Q P Q|Q->P 
aes T T T T 
T F F T F E 
F T T FT F 
F F T FO F T 


The Biconditional Connective. Given two propositions P and Q, the bicondi- 
tional connective + means “if and only if,” and can be used to form the sentence 
PQ. This new connective has the following truth table: 


P Q\|PSQ 
T T T 
T F PF 
FT F 
F OF T 
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Some Propositional Logic Laws 
DeMorgan’s Laws 

LAP VOY SS Ara 

2. a(P AO) S aP Vag 
Conditional Laws 

1. (P3>Q) S$ (-PVQ) 

2. (P>Q)<-(PA-7Q) 
Contrapositive Law 


1. (P > Q) S$ (-Q > =P). 


Quantifiers 


The quantifier V means “for all” and is called the universal quantifier. The quantifier 
4 means “there exists” and is called the existential quantifier. For example, we can 
form the sentences 


1. VzP(x) [means “for all x, P(x)”]. 


2. 4vP(x) [means “there exists an x such that P(x)”]. 


Quantifier Negation Laws 


be Sate Ptr) Vea P(e): 
2, ne dr5P(2). 
( 


Definition (Bounded Quantifiers). We write (Vz € A) P(x) to mean that for every 
x in A, P(x) is true. Also (Ax € A) P(x) means that for some x in A, P(x) is true. 


Bounded Quantifier Negation Laws 


1. a 


—— 


da € A) P(x) @ (Vz € A) AP(z). 
(Vz € A) P(x) & (ar € A) AP(z). 


Definition (Bounded Number Quantifiers). We write (Vx < a)P(x) to mean that 
for every number x <a, P(x) is true. We also write (dz < a) P(x) to assert that for 
some number x <a, P(x) is true. 


Negation Laws for Bounded Number Quantifiers 


a(Va Sn) Pin) (ar Sa Pigk 2. 4Ve< a) Play S (aa = a)aPt): 
2. ola Sn) Pe) Wa Saale). 4,aGe a) Pa) ae <aaP (ae), 


These laws also hold for <, >, >. 
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PROOF STRATEGIES 


1. To prove an algebraic equation, try one of the following: 


(a) Transform one side of the equation into the other side of the equation. 


(b) Derive the equation from any previously given, or assumed, equations. 


2. To prove P > Q, try one of the following: 


(a) Assume P 
Prove Q. 

(b) Assume =Q 
Prove =P. 


3. To prove P A Q, try the following: 


Prove P 
Prove Q. 


4. To prove P V Q, try one of the following: 


(a) Assume —P 
Prove Q. 
(b) Assume 7=Q 
Prove P. 


(c) Try using a division by cases. In each case, prove P or prove Q. 


OU 


. To prove P & Q, try the following: 


Prove P+ Q 
Prove Q > P. 


fo) 


. To prove P by contradiction: 


Assume =P 


Derive “a contradiction.” 
7. To prove VrP(x), or (Vx € A)P(x): 


Let x, or respectively x € A, be arbitrary. Now prove P(z). 


8. To prove drP(x), or (Ax € A)P(x), try one of the following: 


(a) Let x = (the value you found) 
Prove P(z). 


(b) Prove P(x) for some x, or respectively for some x € A. 
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ASSUMPTION STRATEGIES 
1. When assuming P > Q: 


(a) If you are assuming or can prove P, then you can conclude Q. 


(b) If you are assuming or can prove 7Q, then you can conclude =P. 


2. When assuming P V Q: 


(a) If required to prove R, try the following division by cases, 


Case 1: Assume P. 
Prove R. 

Case 2: Assume Q. 
Prove R. 


(b) If you are assuming or can prove —P, then you can conclude Q. 


(c) If you are assuming or can prove 7Q, then you can conclude P. 
. When assuming P A Q: 

You can assume P and assume Q. 

. When assuming P © Q: 


(a) If you are assuming or can prove P, then you can conclude Q. 


(b) If you are assuming or can prove Q, then you can conclude P. 


. When assuming —P: 


(a) Ina proof by contradiction try to prove P and thereby derive a contradiction. 


(b) Reexpress —P as a positive statement, and try to use the positive statement. 


. When assuming VzP(x), or (Vx € A) P(z): 


Take any useful value for x, or respectively x € A; say a, and assume P(a). 


. When assuming 4xP(x), or (Ax € A) P(x): 


Introduce a new term 20, or respectively xo € A, and assume P(9). 


8. Any established theorem can be assumed and used in a proof. 


Well-Ordering and Induction Proof Strategies 


Let b be a fixed integer and let n be an integer variable. 


1. To prove (Vn > b) P(n) by the well-ordering principle, use: 


Assume that —P(n) holds for some integer n > b. 
Let N > b be the smallest such integer satisfying —P(N). 
Derive “a contradiction.” 
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2. To prove (Vn > b) P(n) by mathematical induction, use: 


Base step: Prove P(b). 
Inductive step: Let n > b be an integer. 
Assume P(n). 
Prove P(n + 1). 


3. To prove (Vn > b) P(n) by strong induction with one base step, use: 


Base step: Prove P(b). 
Inductive step: Let n > b be an integer. 
Assume P(k) whenever b < k <n. 
Prove P(n). 


4. To prove (Vn > b) P(n) by strong induction with multiple base steps, identify 
the integer c > b and use: 


Base step: Prove P(b). 
Base step: Prove P(b+ 1). 
Base step: Prove P(c). 
Inductive step: Let n > c be an integer. 
Assume P(k) whenever b < k <n. 
Prove P(n). 


Proof and Assumption Strategies for Set Theory 


1. To prove A C B, use the form: 


Let cE A. 
Prove x € B. 


2. To prove A = B, try one of the following: 


(a) Prove AC B 
Prove B C A. 
(b) Let x be arbitrary. 


ProveexE€ AYE B. 
3. When assuming A C B, if you know or can prove x € A, then you can conclude 
x € B. If you know or can prove x ¢ B, then you can conclude x ¢ A. 


4. When assuming A = B, if you know or can prove x € A, then you can conclude 
x € B. If you know or can prove x ¢ A, then you can conclude x ¢ B. 
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Proof and Assumption Strategies for Functions 


1. To prove f = g where f: A— Band g: A- B, use: 


Let c € A. 
Prove f(x) = g(z). 


2. To prove that a function f: A — B is one-to-one, use: 


Let x€ Aand ye A. 
Assume f(x) = f(y). 


Prove x = y. 
3. To prove that a function f: A > B is onto, use: 


Let y € B. 
Let x = (the element in A you found). 
Prove f(x) = y. 


4. When assuming f: A > B is one-to-one. If you are also assuming or can prove 
that f(x) = f(y), then you can conclude that x = y whenever x,y € A. 


5. When assuming f: A — B is onto, then for any y € B you can conclude that 
f(x) = y for some z € A. 
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List of Symbols 


oo, 3 

fie Mae 10 
#A, 10 

Cc, 10 

Cc, 10 

{C,:1€ I}, 12 


WF 14 


inf(S), 37 
sup(S), 37 


max, 38 


kS, 39 

>, 50 

(Sn), 55 

lim s, = ¢, 57 

noo 

U*, 64 

Uz, 64 

lim sup Zp, 93 
noo 

lim inf 7,, 96 
noo 

(f +g), 103 

(f —g), 103 


(fg), 103 
(kf), 103 


lim f (2) = oo, 116 


f’, 130 

iff, 138 

I*, 145 

f”, 148 
f™, 148 
tf, Py; 152 
M;(f), 152 
P, 152 

U(f, P), 152 
Ax;, 152 


mi(f), 152 

Ye aj, 152 
Say 150 

it 

L(f), 156 
U(f), 156 

fo f, 157 

f° f(x) dx, 157 


V(f,P, [c, dl), 172 


V(f,[e,d]), 173 
for, 184 
Seco 9 184 
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S* ap, 189 
k=1 


>> ax, 189 
k=1 
lim sup xR = 00, 224 
N—- Oo 
p > 0, 226 
bd(S), 239 
int(S), 239 
4 2a 
(Va <a), (Agr < a), 255 


Index 


absolute convergence, 201 
absolute value, 33 

basic properties, 33 

definition, 33 

more properties, 34 
accumulation point, 242 
alternating series, 199 
alternating series test I, 199 
antiderivative, 177 
antidifferentiation 

by inverse substitution, 182 

by parts, 181 

by substitution, 181 
Archimedean property, 47 
ascending prime factorization, 23 
associate law, 30 


backward triangle inequality, 7, 34 
Bernoulli’s inequality, 27, 64, 72 
bijection, 17 
binomial coefficient, 26 
binomial theorem, 26 
Bolzano—Weierstrass Theorem 

for sequences, 85 

for sets, 85 

via compactness, 248 
boundary point, 239 
bounded 

function, 41 

sequence, 66 

set, 36 
bounded above, 36 

sequence, 65 
bounded below, 36 

sequence, 65 
bounded number quantifiers, 255 
bounded quantifier negation laws, 255 
bounded variation, 172 


Cantor set, 249 


cardinality, 51 
Cauchy Convergence Criterion, 88 
for series, 194 
for uniform convergence, 216 
Cauchy Mean Value Theorem, 139 
chain rule, 133 
closed interval, 2 
closed set, 240 
closure, 242 
commutative law, 30 
compact, 244 
Completeness Axiom, 38 
composite function, 17 
composition, 17 
Composition Theorem, 168, 235 
conditional convergence, 201 
constant multiple function, 103 
continuous function 
at a point, 101 
on its domain, 101 
sequential criterion, 110 
uniformly, 124 
countable, 51 


Darboux sum, 152 

lower, 152 

upper, 152 
Darboux, Jean Gaston, 143, 151 
De Morgan’s laws 

for sets, 13, 14 
dense set, 48 
derivative, 130 

higher order, 148 
difference function, 103 
difference quotient, 129 
differentiation rules, 131 
Dini’s Theorem, 220 
direct comparison test, 196 
distributive law, 30 
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divergence test, 192 


equations, how to prove, 3 
Euler’s sequence, 82 

even function, 16 

Extreme Value Theorem, 120 


factorial, 25 
family of sets, 12 
field axioms, 30 
finite intersection property, 249 
finite subcover, 244 
Fixed Point Theorem, 123 
function, 15 
attains a maximum value, 120 
attains a minimum value, 120 
bijective, 17 
bounded, 41 
co-domain, 15 
composite, 17 
constant, 16 
constant multiple, 103 
continuous, 101 
continuous at a point, 101 
decreasing, 138 
difference, 103 
difference quotient, 129 
differentiable at a point, 130 
differentiable on a set, 130 
domain, 15 
even, 16 
identity, 18 
image of a set, 19 
increasing, 138 
infinite limit at a point, 116 
injective, 17 
inverse, 18 
inverse image of a set, 19 
inverse of a 
continuous function, 144 
differentiable function, 145 
limit at a point, 112 
limit theorem, 115 
monotone, 144 
n-times differentiable, 148 


odd, 16 
of bounded variation, 172 
one-to-one, 16 
onto, 17 
polynomial, 16 
preserves convergent sequences, 126 
product, 103 
quotient, 103 
range, 15 
rational, 16 
real-valued, 16 
reciprocal, 105 
second derivative, 148 
single-valued, 15 
strictly decreasing, 138 
strictly increasing, 138 
strictly monotone, 144 
sum, 103 
surjective, 17 
twice differentiable, 148 
uniformly continuous, 124 
zero, 16 
fundamental theorem 
of arithmetic, 23 
Fundamental Theorem of Calculus I, 178 
Fundamental Theorem of Calculus II, 179 


gap, 38 

geometric series, 193 

geometric summation, 24 

greatest lower bound, see infimum 


harmonic series, 192 
Heine—Borel Theorem, 247 
higher order derivatives, 148 


identity function, 18 
improper integral, 184 
increasing sequence of functions, 219 
index set, 11 
indexed family of sets, 12 
inductively defined sequence, 26 
inequality 

how to prove, 4 

laws of, 4 

quotient principles, 57 


sum and product principles, 5 
infimum 
assumption strategy, 40, 45 
definition, 37 
proof strategy, 40, 45 
infinite limit 
of a function, 116 
of a sequence, 91 
infinite series, 189 
absolute convergence, 201 
alternating series test, 199 
Cauchy criterion, 194 
conditional convergence, 201 
direct comparison test, 196 
divergence test, 192 
of functions, 216 
integral test, 198 
limit comparison test, 197 
p-series test, 198 
Ratio Test I, 201 
Ratio Test II, 202 
rearrangement, 207 
regrouping, 206 
Root Test I, 203 
Root Test II, 203 
tail, 194 
tail test, 194 
terms of, 189 
infinitely differentiable, 228 
injection, 16 
integers, | 
integrable, 157 
integrable over I, 180 
integral, 157 
integral test, 198 
integrand, 184 
integration 
by inverse substitution, 182 
by parts, 181 
by substitution, 181 
interior point, 239 
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Intermediate Value Theorem 
for continuous functions, 122 
for derivatives, 143 

interval, 2 
endpoints, 2 
interior point, 2 
left endpoint, 2 
right endpoint, 2 

interval of convergence, 226 

inverse of a 
continuous function, 144 
differentiable function, 145 


L’Hopital’s Rule I, 139 
L’Hopital’s Rule I, 140 
L’Hopital’s Rule HI, 141 
L’Hopital’s Rule IV, 148 
least upper bound, see supremum 
limit comparison test, 197 
limit function, 213 
limit inferior, 96 
limit of a function, 112 

infinite, 116 

at infinity, 117 

one-sided, 115 

proof strategy, 112 

sequential criterion, 114 
limit of a sequence, see sequence 
limit superior, 93 
limit theorems for sequences, 68 
lower bound, 36 
lower integral, 156 


Maclaurin series, 230 
mathematical induction 
proof strategy, 23 
maximum, 34, 38 
Mean Value Theorem, 137 
Cauchy, 139 
for integrals, 171 
minimum, 34, 38 


Monotone Convergence Theorem, 81 


Taeemecay Hor Deny Monotone Subsequence Theorem, 83 


for continuous functions, 120 
for derivatives, 143 
intermediate value property, 121 


natural exponential function, 187 
natural logarithm function, 187 
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natural numbers, 1 
neighborhood, 64 

nested closed intervals, 50 
Nested Intervals Theorem, 50 
nested sets, 51 


odd function, 16 

one-sided limits, 115 
left-hand limit, 116 
right-hand limit, 116 

open cover, 244 

open interval, 2 

open set, 240 

order axioms, 32 


ordered field, 32 


p-series, 198 

P-variation, 172 

partial sum, 189 

partial sum function, 216 

partition, 152 

peak, 83 

point, 2 

pointwise convergence, 213 
series, 216 

power series, 224 
coefficients, 224 
interval of convergence, 226 
Maclaurin series, 230 
radius of convergence, 226 
representation, 230 
Taylor series, 230 

product function, 103 

proof strategies 
Cauchy sequence, 87 
continuity at a point, 102 
continuous function, 108 
equations, 3 
functions, 259 
induction, 23, 258 
limit of a function at a point, 112 
limit of a sequence, 57 
Riemann integrable, 159 
set theory, 258 
uniform continuity, 124 


well-ordering, 257 


quantifier negation laws, 255 
quantifiers 

number bounded, 255 
quotient function, 103 


radius of convergence, 226 
Ratio Test I, 201 

Ratio Test II, 202 

rational numbers, 1 

real analysis, 55 

real numbers, 1 

real-valued function, 16 
rearrangement of a series, 207 
reciprocal function, 105 
refinement of a partition, 152 
Refinement Theorem, 154 
regrouping of a series, 206 
restricted truth set, 2 
Riemann integrable, 157 


necessary and sufficient condition, 158 


Riemann integral, 156 
Rolle’s Theorem, 136 
Root Test I, 203 
Root Test I, 203 


sequence, 55 
binary, 250 
bounded, 66 
bounded above, 65 
bounded below, 65 
Cauchy, 87 
proof strategy, 87 
constant, 55 
convergent, 56 
decreasing, 79 
diverges, 57 
diverges to +00, 91 
increasing, 79 
limit of, 57 
assumption strategy, 60 
proof strategy, 57 
limit theorems 
algebraic, 68 
of distinct points, 55 


of functions, 213 
monotone, 79 
order, 73 
subsequence, 75 
tail-end, 77 
terms of, 55 
unbounded, 66 
series, see infinite series 
set equality 
definition, 10 
set operations 
difference, 10 
intersection 
of indexed family, 13 
of two sets, 10 
union 
of indexed family, 13 
of two sets, 10 
square root, 6 
squeeze theorem 
for functions, 115 
for sequences, 72 
subsequence, see sequence 
subsequential limit, 95 
subset relation 
definition, 10 
sum function, 103 
summand, 5 


summation notation, 24, 152 


» notation, 24, 152 
shift rule, 25 
supremum 


assumption strategy, 39, 44 


definition, 37 
proof strategy, 39, 44 
surjection, 17 


tail of a series, 194 
tail convergence test, 194 
tail-end of a sequence, 77 


Taylor polynomial, 149, 231 
remainder term, 149, 231 


Taylor series, 230 
Taylor’s Theorem, 149 
telescoping series, 191 
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telescoping sum, 27 

terms of an infinite series, 189 
terms of sequence, 55 
transitive law, 32 

triangle inequality, 33 
trichotomy law, 32 

truth set, 2 


unbounded, 37 

uncountable, 51 

uniform continuity, 124 

uniform convergence, 214 
assumption strategy, 215 
proof strategy, 215 
series, 216 

uniformly Cauchy, 216 

upper bound, 36 

upper integral, 156 


variation, 173 
Venn diagram, 10 


Weierstrass M-test, 216 
Well-Ordering Principle, 23 


